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a single NaCs molecule 
from two individual 
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, atoms. The single chemi- 
SINGLE- \ Ne cal reaction takes place 
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Canada’s call 


ext month, Canada will host the Group of 7 (G7) 
summit in picturesque Charlevoix, Québec. As 
leaders from Canada, France, Germany, Italy, Ja- 
pan, the United Kingdom, and the United States 
come together, along with European Union rep- 
resentatives, to discuss the progressive agenda, 
science will be on everyone’s mind. With science 
and technology playing a prominent role in everyday 
life, access to science education and to science-based 
careers is ever more essential for inclusive growth and 
for women’s empowerment. 

In addition to the contribution that science makes 
to economic prosperity, the public is demanding that it 
also be used to guide deci- 
sion-making. Scientists and 
knowledge communities have 
risen to strengthen their case 
to policy-makers: Witness 
the global March for Science 
last month and the renewed 
interest in science advice for 
policy at national and inter- 
national levels. Nowhere has 
this trend been more evident 
than in Canada, where the 
government has signaled its 
respect for science-informed 
policy. This was made clear 
in some remarkable devel- 
opments over the past 10 
months, including my ap- 
pointment as the first govern- 
ment chief science adviser in 
over a decade and a historic 
science budget for 2018. The 
Canadian scientific commu- 
nity is generally upbeat and 
reenergized. Now is the time to ensure that the infrastruc- 
ture and programs are in place to meet rising expecta- 
tions and deliver on science promises and potential. 

The G7 summit themes range from climate change 
and ocean sustainability to preparing for the jobs of 
the future while ensuring a peaceful and secure world. 
All of them require quality data, special infrastructure, 
and coherent multi-institutional and multinational ap- 
proaches. Although these issues will occupy the work 
of Canada’s minister of science and the Office of the 
Chief Science Advisor, they are not unique to Canada 
or to any other country. All the more reason, then, that 
the international science community should encourage 
sharing of best practices and the development of uni- 


“scientists will be part of the 
solutions to global challenges...” 


versally accepted principles for research governance. 
Using and benefiting from major science initiatives 
and facilities should not be confined within national 
borders. Increased international access to highly spe- 
cialized facilities, such as telescopes, cyclotrons, and 
Arctic research stations, should be supported through 
proper resources and mobility programs. Such efforts 
will promote international collaboration and acceler- 
ate discovery and innovation. This can be a corner- 
stone for international relations. 

Developing policies to meet national and international 
aspirations and commitments requires a well-organized 
science advice ecosystem inside and outside of govern- 
ment. This includes, in ad- 
dition to national science 
advisers, a network of de- 
partmental and subnational 
science advisers, science 
academies, and_ scientific 
associations. Each plays a 
distinct and complementary 
role in gathering evidence, 
convening experts, and com- 
municating with the public, 
decision-makers, and _ stake- 
holders. Strong national in- 
stitutions are essential for 
supporting and sustaining a 
robust ecosystem of science 
advice and public engage- 
ment. With science and tech- 
nology increasingly a part of 
daily life, the need for science 
advice will only intensify. 

“Science knows no country, 
because knowledge belongs 
to humanity,’ Pasteur told us. 
This is more relevant today than ever before. In the com- 
ing years, scientists will be part of the solutions to global 
challenges, and they will do this collaboratively. Interna- 
tional gatherings, such as those of the G7, the G20, the 
United Nations, and the Commonwealth Heads of Gov- 
ernment, could benefit from advance meetings of science 
advisers, both to help frame the issues and to signal how 
science might be harnessed to solve important problems. 
And Canada, as a member country and one that has made 
science and innovation a priority, is in a good position 
to convene and coordinate such efforts. I look forward to 
working with my colleagues to build on the momentum of 
next month’s summit in Charlevoix. 

-Mona Nemer 


Mona Nemer is 
the chief science 
adviser of Canada. 
science@canada.ca 
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FDA approves first migraine preventive 
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he U.S. Food and Drug Administration (FDA) approved on 

17 May the first drug to prevent migraine headaches, an 

antibody that blocks the receptor of a protein thought to play a 

central role in causing the condition (Science, 8 January 2016, 

p. 116). The treatment, erenumab (Aimovig), is made by 

Amgen in Thousand Oaks, California, and Novartis of Basel, 
Switzerland. The required monthly injections will cost $545 per dose 
in the United States. Migraines, which can be disabling, are estimated 
to afflict 6% of men and 17% of women in the United States. Nearly 
1% of Americans are chronic sufferers, experiencing headaches on 
15 or more days per month. In three clinical trials involving 2199 mi- 
graine sufferers, those who received erenumab instead of a placebo 
experienced 1 to 2.5 fewer migraine days per month. Three other com- 
panies have similar drugs awaiting FDA approval. 
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Neil Gemmell of the University of Otago, in Dunedin, New Zealand, on a project to 
study environmental DNA from Scotland's Loch Ness for signs of unusual creatures. 


Sweden cancels Elsevier contract 


PUBLISHING | Swedish universities and 
research institutes have canceled their 
contract with academic publishing giant 
Elsevier after failing to reach an agreement 
that aligns with the country’s vision for 
providing free access to all publicly funded 
research by 2026. The Bibsam Consortium, 
which negotiates licensing arrange- 

ments on behalf of Sweden’s universities, 
demanded a more “sustainable price 
model.” It estimates that Swedish insti- 
tutes spent €1.3 million in 2017 to publish 
articles in Elsevier’s open-access journals, 
on top of €12 million for accessing the 
publisher’s subscription journals. The 
consortium sought a deal that required 
Elsevier to provide immediate open access 
to all papers by their researchers. The 
cancellation means Swedish universities 
will not have access to Elsevier content 
published after 30 June. 


New head for DOE science 


SCIENCE POLICY | President Donald 
Trump has nominated a neuroscientist and 
former member of the White House staff 
under President Barack Obama to lead 

the U.S. Department of Energy’s (DOE’s) 
science programs. The White House 
announced on 18 May the nomination of 
Chris Fall, currently principal deputy direc- 
tor of DOE’s Advanced Research Projects 
Agency-Energy (ARPA-E), which helps 
commercialize promising technologies, to 
head the $6 billion DOE Office of Science. 
It is the nation’s leading funder of the 
physical sciences and supports an array of 
facilities used extensively by researchers. 
Prior to joining ARPA-E, Fall spent 6 years 
at the Office of Naval Research in a variety 
of roles, including a 3-year stint working 
on national security issues in Obama’s 
Office of Science and Technology Policy. 
The Senate must confirm Fall for the role. 


Moderate drinking study stopped 


BIOMEDICINE | The U.S. National 
Institutes of Health (NIH) has halted a 
study of the possible benefits of moderate 
drinking while it conducts an ethics and 
scientific review. The Moderate Alcohol 
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Plants weigh the most, but humans, livestock add up 


lants collectively pack more heft than any other kingdom of life, 
even more than bacteria. That's just one of the surprises in a com- 
prehensive survey of the scientific literature on Earth's biomass. 


Kingdoms of life 
Bacteria 


Protists 


Animals 
2 


Viruses 
0.2 


Archaea 
8 


Annelids 
0.2 
Molluscs 
0.2 


Cnidarians 
0.1 


Humans 
0.06 

Wild 
mammals 
0.007 


Nematodes Wild birds 
0.02 0.002 


Animals 


Arthropods 
1GTC 


mil 


0.1 


the human impact on biomass since the beginning of civilization 
has been huge, Milo's group reports this week in the Proceedings of 


It also shows that the most diverse groups are not necessarily the 

ones with the most biomass, so this measure provides a different 
perspective of life on this planet, says Ron Milo, a quantitative biologist 
at the Weizmann Institute of Science in Rehovot, Israel, whose student 
Yinon Bar-On combed through hundreds of studies. Humans weigh in at 
only about 0.06 gigatons of carbon—on par with krill and termites—but 


the National Academy of Sciences. Humans and their livestock (mostly 
cattle and pigs) outweigh wild mammals by more than 22-fold, and 
domesticated fowl surpass all other birds in biomass, they found. There 
has also been a twofold decline in plant biomass since civilization began. 
Uncertainty in numbers for subsurface microbes made estimating their 
overall mass difficult, the authors say. The biomass figures set the stage 
for their next challenge: estimating the planet’s most abundant protein. 


and Cardiovascular Health Trial has come 
under scrutiny because the alcoholic bever- 
age industry is putting up two-thirds of the 
funding for the $10 million, 10-year study. 
In March, The New York Times reported 

on emails and other documents indicat- 
ing that NIH employees urged industry 
officials to fund the trial, a request that 
apparently violated agency policy. At a 
Senate hearing last week, NIH Director 
Francis Collins said “sufficient concerns” 
led the agency to suspend enrollment in 
the trial until the investigation is com- 
pleted in June. The study, led by Beth 
Israel Deaconess Medical Center in Boston, 
had already recruited 105 of a planned 
7800 participants. 


U.S. science eyes spending boost 


RESEARCH FUNDING | The process of 
setting 2019 funding levels for U.S. sci- 
ence agencies is underway, and so far, 
researchers are mostly pleased with the 
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results. This month, Senate and House of 
Representatives spending panels began 
advancing appropriations bills that gen- 
erally reject President Donald Trump’s 
requests for cuts to science programs and 
instead specify several increases for the 
fiscal year that begins 1 October. Both 
bodies would give the Department of 
Energy’s science office a roughly 5% boost 
to $6.6 billion. The House bills call for the 
National Science Foundation’s budget to 
grow by 5.2% to $8.2 billion and NASA’s 
science directorate to rise 7.4% to 

$6.7 billion. The U.S. Geological Survey 
would get a 1.6% boost to $1.2 billion, 
while the Department of Agriculture’s com- 
petitive grants program grows 3.8% to 
$415 million. But the bills cut science 
programs at the Environmental Protection 
Agency by 8.9%, to $644 million, and 
trim research at the National Oceanic and 
Atmospheric Administration (NOAA) by 
8.2%, to $503 million, including a 

37.6% cut to NOAA climate science, to 
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$99 million. The House has yet to prepare 
a bill to fund the National Institutes of 
Health. The two chambers ultimately must 
reconcile their spending bills. 


Reprieve for climate program 


CLIMATE SCIENCE | A spending bill 
approved last week by the appropria- 

tions committee of the U.S. House of 
Representatives tells NASA to resurrect the 
$10 million Carbon Monitoring System, 
which President Donald Trump’s adminis- 
tration canceled (Science, 11 May, p. 586). 
Climate scientists said the administration’s 
move would jeopardize plans to verify the 
emission cuts that countries agreed to in 
the Paris climate accords. The monitoring 
system has supported the collection of data 
about carbon sources and sinks and creat- 
ing high-resolution models of the planet’s 
flows of carbon. The provision is contained 
in a House bill that sets 2019 spending 
levels for NASA and other agencies. 
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Struggling to serve Iran 


Kaveh Madani returned to his native 

Iran last September to serve in the 
government as deputy vice president for 
the environment. Now 36, he had spent 
14 years overseas and worked since 
2013 as a faculty member at Imperial 
College London, where he studied Iran's 
dwindling water resources. But in Iran, 
hardliners had him interrogated and 
accused him of spying. In April, during 

a work trip to Bangkok, he resigned 

his post and moved to an undisclosed 
location outside Iran. He spoke with 
Science about his experiences. (An 
extended version of this interview is 
available at https://scim.ag/IranReturn.) 


Q: Why did you return to Iran? 

A: | thought if | succeed, there would be 
more trust in scientists and more trust in 
people of my generation. And to inspire 
other Iranians outside to go back and help. 
... IN some ways, my time in Tehran was 
the best sabbatical one academic could 
wish for. This was a wonderful learning 
experience. On the other hand, | realize 
that I’m lucky I'm not in prison, or dead. 


Q: Is there a water crisis in Iran? 

A: Our demand and consumption are 
way higher than the available water. ... 
The government cannot deny the dete- 
rioration of Lake Urmia. So speaking out 
about the environment became a sort of 
safe space [for Iranian citizens]. 


Q: What message were hardliners 

trying to send by discrediting you? 

A: [They said |] was trying to paint a 
dark picture of lran’s water sources to 
justify shutting down the agricultural 
sector and importing GMOs [genetically 
modified organisms]. They called mea 
bioterrorist. | said, “You're misinterpret- 
ing scientific facts.” ...| knew | wasn't 

a spy. Getting rid of me was their main 
goal. ... Some of the questions | was get- 
ting during my interrogation in Tehran 
were the same as what | get at the 
borders of some countries in the West. 
Hardliners don't like a peaceful environ- 
ment because conflict is essential to 
their survival, no matter where they are. 
| think | was more of a threat because 

| was breaking down borders. Trying to 
solve things with a scientific approach, 
trying to be inclusive. 


S SCIENCEMAG.ORG/NEWS 
Read more news from Science online. 
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Insider to lead cancer agency 


PUBLIC HEALTH | The controversial 
International Agency for Research on 
Cancer (IARC) last week elected cancer 
epidemiologist Elisabete Weiderpass, a 
member of IARC’s scientific council, as its 
new director. The announcement immedi- 
ately triggered industry calls for Weiderpass 
to change the agency’s allegedly biased 
procedures for rating cancer risks. IARC, an 
arm of the World Health Organization based 
in Lyon, France, has come under fire for 
calling the widely used herbicide glyphosate 
a “possible human carcinogen’—a classifica- 
tion that Weiderpass has defended but other 
agencies disagreed with (Science, 24 June 
2016, p. 1504). Weiderpass, the first woman 
at the agency’s helm, is Brazilian and holds 
Finnish and Swedish passports; she leads 
the research department of the Cancer 
Registry of Norway, in Oslo, as well as the 
genetic epidemiology group at Folkhalsan 
Research Center in Finland. 


New ministry for Russian science 


SCIENCE POLICY | Ina major restructur- 
ing, the Russian government has created 
a new Ministry for Science and Higher 
Education with a new leader. The move 
splits the former Ministry of Education 
and Science, headquartered in Moscow. 
The new science ministry will also oversee 
universities while a new Ministry of 
Education manages primary and second- 
ary schools. Heading the science ministry 
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will be Mikhail Kotyukov, a former head 

of the controversial Federal Agency for 
Scientific Organizations (FASO). Kotyukov, 
who has a background in finance, has 
headed FASO since its creation in the fall 
of 2013. The agency was established as 
part of a reform of the Russian Academy of 
Sciences and given control over academy 
property, a move opposed by scientists and 
widely seen as a power grab by the Russian 
government. Rather than become part 

of RAS, as some members had proposed, 
FASO will now be absorbed into the science 
ministry, further strengthening the govern- 
ment’s grip on the academy. 


Satellites resume water watch 


EARTH SCIENCE | NASA and a German 
partner launched two satellites on 22 

May that will measure trace changes in 
Earth’s gravity as a proxy for the move- 
ment of water on the planet, replacing a 
pair that failed in orbit last year. The $550 
million Gravity Recovery and Climate 
Experiment Follow-On (GRACE-FO) mis- 
sion took off from Vandenberg Air Force 
Base in California. Like its predecessors, 
GRACE-FO will gather data by constantly 
monitoring, via microwave pulses, the 
changing distance between its satellites as 
the gravity of large objects below, such as 
ice sheets, first pulls one spacecraft and 
then the other. This technique has provided 
insights into the melting of ice sheets, the 
emptying of groundwater aquifers, and the 
motion of deep ocean currents. 


BIOETHICS 


A forgotten contribution, 
now honored 


y hanging a portrait of her this 
month, not far from its depictions 
of presidents, the Smithsonian 
Institution’s National Portrait 
Gallery in Washington, D.C., 
honored a woman whose contributions 
to biomedical research were major, 
ongoing—and without her knowledge or 
consent. Henrietta Lacks, a poor woman 
from rural Virginia, died of cervical can- 
cer in 1951 at age 31. While she was alive, 
doctors removed and biopsied a tumor 
and used it to create what became the 
HeLa line (named for her) of immortal 
cells, capable of renewing themselves in 
cultures indefinitely. Researchers used 
them to develop the polio vaccine and in 
numerous other medical advances. 
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Chunks of glass will form one of seven 8-meter mirrors for the Giant Magellan Telescope. 


Rival giant telescopes join to seek U.S. funding 


Projects will offer “public access” in bid for support from astronomy community 


By Eric Hand 


wo U.S.-led giant telescope projects, 

rivals for nearly 2 decades, announced 

this week that they have agreed to 

join forces. The Giant Magellan Tele- 

scope (GMT), a 25-meter telescope 

under construction in Chile, and the 
Thirty Meter Telescope (TMT), which back- 
ers hope to build atop Mauna Kea in Hawaii, 
are still short of partners and money. They 
will now work together to win funding from 
the National Science Foundation (NSF) in 
Alexandria, Virginia, which could help the 
projects catch up to a third giant telescope, 
the 39-meter European Extremely Large 
Telescope (ELT), due to begin operations 
in 2024. It is a historic peace accord to end 
a conflict that has divided funders and de- 
layed both projects. 

“This division has set back U.S. astron- 
omy a decade,” says Richard Ellis, an astron- 
omer at University College London, and a 
former leader of the TMT effort. “Let’s turn 
the corner.” Patrick McCarthy, a GMT vice 
president in Pasadena, California, adds, “It’s 
time for these two projects to come together 
behind a single vision.” 

The partnership, approved by the GMT 
board this month and by the TMT board 
last month, commits the two projects to de- 
veloping a joint plan that would allow as- 
tronomers from any institution to use the 
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telescopes; under previous plans observ- 
ing time was available only to researchers 
from nations or institutions that had pro- 
vided funding. The projects are discussing 
awarding at least 25% of each telescope’s 
time to nonpartners through a competitive 
process to be administered by the National 
Center for Optical-Infrared Astronomy—an 
umbrella organization that will replace the 
National Optical Astronomy Observatory 
(NOAO), based in Tucson, Arizona, some- 
time in fiscal year 2019 (Science, 3 February 
2017, p. 442). 

Telescope backers hope the public ac- 
cess plan will help persuade the federal 
government to pay for at least 25% of the 
total cost of the two facilities, a share that 
could reach $1 billion. (Cost estimates for 
the GMT and the TMT are $1 billion and 
$1.4: billion, respectively, but astronomers 
expect both numbers to grow.) “There are 
many science projects that are $1 billion 
class projects,” says David Silva, NOAO’s 
director. “The investment that we would 
want is of a similar size.” 

Advocates for the TMT and the GMT 
“missed opportunities” to unite astrono- 
mers, lawmakers, and funding agencies 
around a single telescope in the early 
2000s, Ellis says. Instead, the projects 
diverged. The TMT began in 2002 under 
the leadership of the California Institute 
of Technology in Pasadena and the Uni- 
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versity of California system. To make a 
giant mirror that wouldn’t deform under 
its own weight, the TMT’s partners looked 
for inspiration in a fly’s eye, fragment- 
ing the main 30-meter mirror into nearly 
500 small, 1.4-meter hexagonal pieces, 
each individually adjustable. The GMT, ini- 
tially led by the Carnegie Observatories in 
Pasadena, has seven larger segments, each 
8 meters across. They are made of stiff, 
lightweight glass, cast in molds that spin 
as the glass cools to bring it into a para- 
bolic shape. In 2005, the GMT cast the first 
of its mirrors, and the divergent techno- 
logical paths were set. 

More than a decade later, the timing is 
right for rapprochement. First, the U.S. 
decadal survey in astronomy, a once-a- 
decade priority-setting exercise for big 
projects run by the National Academies of 
Sciences, Engineering, and Medicine, is set 
to begin next year, and the public access 
plan could solidify astronomers’ support for 
putting the giant telescopes high on the list. 
In making their case, the teams will argue 
the benefits of having telescopes in both 
the northern and southern hemispheres. 
“When you are covering the whole sky, you 
have greater scientific reach,” says Wendy 
Freedman, an astronomer at The University 
of Chicago in Illinois who was the found- 
ing leader of the GMT. The teams will also 
argue that the telescopes have complemen- 
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tary strengths. The design of the GMT, for 
instance, makes it ideal for a high-resolution 
spectrograph designed to probe the atmo- 
spheres of exoplanets. The TMT, which has 
more light-gathering power, could host a 
multi-object spectrograph to quickly gather 
demographic statistics on the universe's 
first galaxies. 

Uniting the projects could also help over- 
come funding challenges. Each project has 
already amassed support from countries 
and deep-pocketed institutions, but neither 
has enough money to cross the finish line. 
TMT construction has faced costly delays 
because of legal challenges and protests 
by Native Hawaiian groups that object to 
building atop Mauna Kea, which they con- 
sider sacred; planners are considering relo- 
cating the TMT to Spain’s Canary Islands. 
And although the GMT hopes to start ob- 
serving in 2024 with four of its seven mir- 
rors, it, too, needs new partners to see it 
through. The European Southern Observa- 
tory, meanwhile, has made steady progress 
on the ELT, aided by its regular member 
state contributions. “The ELT is real,’ Silva 
says. “That focused minds.” 

In the end, success will hinge on NSF 
and Congress. Silva says NSF Astronomical 
Sciences Division Director Richard Green, 
as well as Anne Kinney, an astronomer re- 
cently chosen to head the agency’s math- 
ematical and physical sciences directorate, 
“were willing to open the door to explor- 
ing this possibility.” But Kinney says she 
and Green first want to see the results of 
the decadal survey. “Depending on how the 
GMT and TMT come out, we are very inter- 
ested in the large telescopes,” she says. 

A key lawmaker appears to hold simi- 
lar views. Representative John Culberson 
(R-TX), who chairs the House of Represen- 
tatives spending panel that sets funding lev- 
els for NSF, told Science he could envision 
NSF helping pay for the telescopes. “But 
Tm waiting to hear from the next decadal 
survey” before taking a definitive position, 
he says. Still, Culberson’s panel last week 
approved a 2019 spending bill that directs 
NSF to “articulate a plan to ensure public 
access in future large optical observatories.” 

Even if NSF ultimately gets money to help 
build the two telescopes, funding for long- 
term operating expenses may be harder 
to find. The agency is already expecting to 
strain to pay running costs for new tele- 
scopes such as the Large Synoptic Survey 
Telescope, an 8.4-meter survey telescope in 
Chile—let alone two much bigger ones. Silva 
says the agency is “really struggling to enable 
leading-edge facilities while maintaining 
healthy grants programs.” 


With additional reporting by Jeffrey Mervis. 
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China’s ambitious brain 
science project inches forward 


Beijing and Shanghai open new research centers ahead 
of the launch of a 15-year national effort 


By Dennis Normile, in Shanghai, China 


he nascent China Brain Project took 

another step toward reality last week 

with the launch of the Shanghai Re- 

search Center for Brain Science and 

Brain-Inspired Intelligence. The new 

center and its Beijing counterpart, 
launched 2 months ago, are expected to be- 
come part of an ambitious national effort 
to bring China to the forefront of neuro- 
science. But details of that 15-year project— 
expected to rival similar U.S. and EU ef- 
forts in scale and ambition—are still being 
worked out, 2 years after the government 
made it a priority. 

Preparation for the national effort “was 
taking quite a long time, says Zhang 
Xu, a neuroscientist and executive direc- 
tor of the new center here. So Beijing and 
Shanghai got started on their own plans, 
he says. China’s growing research prow- 
ess and an increasing societal interest in 
neuroscience—triggered in part by an aging 
population—as well as commercial opportu- 
nities and government support are all com- 
ing together to make this “a good time for 
China’s brain science efforts,’ Zhang says. 

Government planners called for brain 
research to be a key science and technol- 
ogy project in the nation’s 13th Five-Year 
Plan, adopted in spring 2016. The effort 
would have three main pillars, according 
to a November 2016 Neuron paper from 
a group that included Poo Mu-ming, di- 
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rector of Shanghai’s Institute of Neuro- 
science (ION), part of the Chinese Acad- 
emy of Sciences (CAS). It would focus on 
basic research on neural mechanisms 
underlying cognition, translational studies 
of neurological diseases with an emphasis 
on early diagnosis and intervention, and 
brain simulations to advance artificial in- 
telligence and robotics. Support under the 
5-year plan was just the start of a 15-year 
program, the group wrote. 

The authors saw an opportunity for ba- 
sic research in the “enormous gap in our 
knowledge [of the brain] at the mesoscopic 
level,” where neural circuits connect specific 
types of brain cells. They suggested China 
might make an important contribution by 
mapping those circuits and activity patterns 
using animal models, a plan now called the 
International Mesoscopic Connectome Proj- 
ect. China has developed expertise in study- 
ing the brain using macaques, and animal 
research faces less opposition in China than 
in the West. (Zhang, who is also at ION, ad- 
vocates for a fourth focus, on how education 
in mathematics and language, for example, 
affects neural development.) 

But an official green light is still pending. 
Scientists suggest the complexity of the proj- 
ect and a recent revamp of responsibilities 
at the Ministry of Science and Technology 
have delayed a final decision. “We sincerely 
hope we can see the announcement of the 
launch of this project very soon,” says Yang 
Xiong-Li, a neuroscientist at Fudan Univer- 
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Elderly women in Shanghai, China, suffering 
from dementia, one of the diseases the new effort 
may tackle. 


sity here. The city of Beijing and local insti- 
tutions took the initiative and launched the 
Chinese Institute for Brain Research, an- 
nounced on 22 March; Beijing is putting up 
much of the funding for the institute. It and 
the Shanghai center “aim to become world- 
class research institutes for brain research,” 
says Luo Minmin, a neuroscientist at the 
National Institute of Biological Sciences in 
Beijing and co-director of what will be the 
northern center. 

The Shanghai center will “integrate and 
expand” ongoing efforts, Zhang says. The 
city’s Huashan Hospital, long a center for 
neurosurgery, is already building a new fa- 
cility dedicated to clinical neuroscience, he 
notes; CAS and the city of Shanghai also 
put up money for a brain mapping proj- 
ect in 2012 and a scheme to get area brain 
researchers to collaborate in 2014. The re- 
gion hosts budding commercial efforts to 
develop brain-inspired technologies. Zhang 
estimates some 1 billion yuan ($157 mil- 
lion) annually has been flowing into brain- 
related research here in recent years. 

Although the new center will start re- 
cruiting its own research teams later this 
year, Zhang says, many of its principal in- 
vestigators will have dual appointments 
and keep labs at their current institutes. 
The center will provide a mechanism to 
share pricey equipment and will establish 
a database to gather research and clinical 
information not currently being shared. 
International cooperation will be a priority. 

Once the national effort is underway, the 
two new centers are likely to become its 
northern and southern hubs. Researchers 
expect a high-level committee to set priori- 
ties, but the two centers will retain a de- 
gree of autonomy. In Beijing, for instance, 
“we are encouraged by the Board of Trust- 
ees, which is led by the Beijing municipal 
government, to do whatever we think is 
important,” Luo says. 

As for the overall project, Yang urges a 
step-by-step approach. “Given the ambi- 
tious scope of the project and the relatively 
limited resources in China, I don’t think it 
is realistic to start the project all at once.” 

The next step just might be the launch of 
Poo’s proposed connectome project. It was 
the topic of a 2 May symposium in Beijing 
organized by the science ministry. But as 
for when it might be approved, “I am not 
at liberty to make any statement at this mo- 
ment,” Poo wrote in an email to Science. 


With reporting by Bian Huthui in 
Shanghai, China. 
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ETHICS 


German law allows use of 
DNA to predict suspects’ looks 


Bavaria opens the door to DNA phenotyping, provoking 
debate about the technique’s ethics and accuracy 


By Gretchen Vogel, in Berlin 


olice in the German state of Bavaria 

now have the authority to analyze 

forensic DNA samples to predict the 

geographical ancestry and physical 

characteristics—hair color, eye color, 

skin color, and age—of an unknown 
suspect who poses an imminent danger. The 
controversial law, which passed the Landtag, 
the state parliament in Munich, on 15 May, 
is the first in Germany to allow what has 
been dubbed DNA phenotyping, and it has 
sparked renewed debate here and in other 
countries about the advantages—and risks— 
of such methods. 

German federal authorities, and police in 
the country’s 15 other states, are only allowed 
to perform DNA fingerprinting, in which they 
look for an exact match between crime scene 
DNA and samples in a database of known 
criminals or from a suspect. DNA pheno- 
typing and ancestry prediction, however, 
have been used in the Netherlands, France, 
the United Kingdom, Canada, and several 
US. states. Some of those jurisdictions have 
explicit laws permitting and regulating the 
practice. In others, police have begun to use 
DNA phenotyping without explicit regula- 
tion. Well-publicized success stories—and 
advances in the science—have fed interest in 


the method. Switzerland, like Germany, does 
not allow such use of DNA, but lawmakers 
there are considering reversing that ban. 

Some critics caution that although the 
underlying science connecting genetic mark- 
ers to certain physical features is solid, DNA 
phenotyping and ancestry prediction can still 
be easily misunderstood by police and the 
public. Proponents “exaggerate the numeri- 
cal certainties,” says Veronika Lipphardt of 
the Albert Ludwigs University of Freiburg in 
Germany, who studies the history and uses 
of population genetics. “That creates the im- 
pression that it’s clear-cut what race someone 
is or where someone comes from, and that’s 
not true.” One U.S. company is even using 
DNA phenotyping to create facial sketches 
of suspects, which goes beyond what many 
scientists now consider credible. 

“You would need a lot of training of police 
forces to use [DNA phenotyping] responsi- 
bly,’ says legal scholar Carsten Momsen of 
the Free University of Berlin, who worries 
the technique could lead to the targeting of 
minority groups. Bavaria’s new law doesn’t 
cover investigations of crimes that have al- 
ready been committed; those are still ruled 
by federal law. Instead, it allows DNA pheno- 
typing in situations of “imminent danger,” 
when police suspect someone is planning 
a serious crime. For example, authorities 


Traces of DNA from items at a crime scene can predict a person’s hair, eye, and skin color—and more, some say. 
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could analyze trace DNA found on stashes of 
weapons or bombmaking materials. The un- 
certainties in the technique can be especially 
problematic in such high-pressure situations, 
Momsen says. 

Geneticist Manfred Kayser of Erasmus 
University Medical Center in Rotterdam, the 
Netherlands, who has developed several of 
the DNA phenotyping techniques permitted 
under the new Bavarian law (Science, 18 Feb- 
ruary 2011, p. 838), acknowledges that “there 
has to be education and training involved. 
You have to be able to work with probabili- 
ties.” Still, Kayser notes that police constantly 
have to decide how trustworthy various kinds 
of evidence are. At least investigators can 
quantify the uncertainty in results from DNA 
analyses, he says. “With eyewitness testimony 
you have no idea if it’s right or wrong” 

The techniques attempt to wring informa- 
tion out of traces of blood, skin, semen, or 
other DNA-containing cells found at a crime 
scene. They read single-nucleotide poly- 
morphisms (SNPs), individual DNA bases 
that can vary between peo- 
ple and directly influence 
or correlate with differences 
in certain physical features. 


“You would need 


ple known from archaeological remains. 

Going beyond skin, hair, and eye color to 
predicting geographical ancestry is harder. 
Computer programs can compare a suspect’s 
SNP patterns to those in databases from mul- 
tiple populations. But some populations may 
be missing from the databases and others may 
be indistinguishable because of mixing. Tell- 
ing whether a person’s ancestors were East 
Asian, African, Native American, Western 
Eurasian, or from Oceania is fairly straight- 
forward. But predicting whether someone 
has northern European versus Middle East- 
ern ancestry is sometimes impossible. 

Still more controversial is Parabon Nano- 
Labs in Reston, Virginia, a company that 
generates facial shape profiles in addition 
to skin, eye, and hair color and geographical 
ancestry. Ellen Greytak, its director of bio- 
informatics, says the firm starts with a stan- 
dard face based on the person’s DNA-derived 
ancestry and sex. Further analysis of genetic 
markers leads to a face that “emphasizes the 
features we predict will be distinctive” about 
the person, she says. 

That goes too far, 
Kayser says. “The science 
is not there at all.” The 
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their DNA. They can also 
use age-related chemical 
changes in DNA to esti- 
mate a person’s age to within about 4 years. 

The accuracy of the results depends on 
several factors, including the quality—and 
quantity—of the sample tested and whether 
the genetic variants found in the sample are 
well-represented in the model databases. 
Kayser and his colleagues in Rotterdam 
and at Indiana University-Purdue Univer- 
sity Indianapolis recently released their lat- 
est iteration of a tool called HIrisPlex-S. The 
test returns probabilities for three eye colors 
(blue, brown, or green/hazel); red, blond, 
brown, or black hair; and five skin shades, 
ranging from “very pale” to “dark-black.” 

In April, they reported in Forensic Science 
International: Genetics that the test correctly 
predicted all three characteristics for 17 out 
of 19 simulated crime scene samples. (Two 
samples did not have high enough quality 
DNA to make a prediction.) The test also cor- 
rectly identified five out of six other samples 
as having cells from two or more people, 
although it could not say what they looked 
like. Police in the Netherlands and France 
have used the test for eye and hair color in 
several cases. Ancient DNA researchers have 
also used it to predict characteristics of peo- 
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“It’s not going to point to 
an individual. It’s going to 
exclude people. ... The vast 
majority of people are not going to fit the 
description.” The company’s website de- 
scribes several cases where its phenotyping 
helped authorities find a culprit. 

Tronically, DNA phenotyping might have 
been of little use in the case that helped 
prompt the new Bavarian regulations. In late 
2016, a medical student was raped and mur- 
dered in Freiburg; an asylum seeker, origi- 
nally from Afghanistan, was convicted of the 
crime. But some authorities complained that 
they could have narrowed their search more 
quickly if they had been able to use trace 
DNA to predict what the culprit looked like 
and where they might be from. 

The crucial evidence in the Freiburg mur- 
der case, however, was nongenetic: a strand 
of hair with an unusual dye pattern—dark at 
the root, blond at the ends. Police spotted the 
suspect, who had the same dye pattern, in 
surveillance video from a tram stop near the 
crime scene. A DNA test would have only in- 
dicated dark hair, not enough to pick out the 
suspect in the video footage. Kayser agrees 
that the case wasn’t well suited to argue for 
a law to allow DNA phenotyping. “It was the 
wrong case to make that claim,” he says. 
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B6l2 plans 
asteroid 
hunt with 
fleet of small 
satellites 


After abandoning plans 
for a large space telescope, 
private foundation pins 
hopes on new technologies 


By Adam Mann 


ast week, an asteroid the size of 
Egypt’s Great Pyramid of Giza 
whizzed by Earth, missing it by half 
the distance to the moon. The con- 
cern that we may one day not be so 
lucky has long preoccupied the B612 
Foundation, a private organization in Mill 
Valley, California, dedicated to finding as- 
teroids that cross Earth’s orbit and could 
devastate humanity. B612 itself had a near- 
death experience 3 years ago, when its 
bold plans for an asteroid-hunting space 
telescope fell apart. But now, its ambitions 
are rising again with a new technique for 
finding menacing objects. 
On 10 May, B612 announced a partner- 
ship with York Space Systems, a Denver- 


On the hunt 


Congress has tasked NASA with finding 90% of near- 
Earth asteroids bigger than 140 meters. Most of the 
biggest are known. Organizations like the B612 Foun- 
dation are trying to make progress on smaller ones. 
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based maker of standard 85-kilogram 
satellites, to investigate building a fleet of 
small asteroid hunters. For many years, 
B612—which takes its name from the as- 
teroid home of Antoine de Saint-Exupéry’s 
Little Prince—aimed to build and launch 
a much larger craft, Sentinel, a $450 mil- 
lion space telescope with a 50-centimeter 
mirror. In 2012, NASA agreed to provide 
logistical support. But fundraising stalled 
and, in 2015, the agency ended its agree- 
ment with B612 because it wasn’t meeting 
mileposts, essentially killing the telescope. 
Now, B612 has developed a new technique 
to do the same thing at a far lower cost 
with small space telescopes. Ed Lu, B612’s 
co-founder, expects the first telescope to 
cost about $10 million and believes a full 
constellation “would be a factor of many, 
many cheaper” than Sentinel. 

Some asteroid astronomers are skeptical 
of the new approach, saying the technol- 
ogy is far from proven. “To be very, very 
blunt, what they are proposing and what 
they’ve demonstrated is not going to help 
us find more NEOs [near-Earth objects],” 
says astronomer Timothy Spahr, CEO of 
space consultancy firm NEO Sciences in 
Marlborough, Massachusetts, who does in- 
dependent work for NASA. 

Astronomers believe they have already 
discovered 95% of the NEOs of civilization- 
ending size, a kilometer or more across. 
But ones between 140 meters and 1 kilo- 
meter across—still big enough to wipe out 
a city—are so faint that an estimated 59% 
remain undiscovered. Finding them, and 
perhaps developing plans for diverting 
threats, requires a big telescope, like Sen- 
tinel, or long exposures on a smaller one. 
But if the asteroids are moving fast, they 
shift from pixel to pixel on the telescope’s 
camera, and the long exposures provide 
little benefit. 

In 2013, astronomer Michael Shao of the 
Jet Propulsion Laboratory (JPL) in Pasa- 
dena, California, and his colleagues out- 
lined a technique called synthetic tracking, 
which gets around this issue by taking up 
to 100 snapshots in a single second. A dim 
asteroid won’t appear in any particular 
image, but by arranging and stacking the 
photos in thousands of combinations, the 
system can get lucky and preferentially 
brighten an unknown fast-moving object. 

Doing this for an unknown asteroid of 
unknown brightness and speed at an un- 
known point in the sky is computationally 
demanding. But Shao says advanced pro- 
cessing power makes it feasible. He adds 
that the technique would help a small tele- 
scope do as good a job as a bigger one, even 
though it gathers less light. It could also 
help uncover the fastest moving asteroids, 
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The B612 Foundation is partnering with York Space Systems to build a fleet of small asteroid-spotting satellites. 


such as the object ‘Oumuamua, an inter- 
stellar asteroid spotted zipping through 
the solar system last year. 

Shao and his team simulated how nine 
20-centimeter telescopes in orbit around 
the sun would fare with the technique. In a 
2015 study, they projected that the constel- 
lation could discover 90% of Earth-grazing 
asteroids 50 meters or larger in diameter 
in 5 years, three times faster than other 
techniques and faster than Sentinel itself. 
Shao says within a week of publishing his 
paper, B612 contacted him. 

Last year, B612 tested the technique in 
space by asking the Earth-observing com- 
pany Planet to turn one of its SkySat sat- 
ellites around and aim its 35-centimeter 
telescope at already known asteroids. By 
applying synthetic tracking, the team could 
tease out images of the faint objects. 

Now, B612 needs to raise money for its 
new plan. But the asteroid-hunting field is 
becoming more crowded. Starting in 2021, 
the Large Synoptic Survey Telescope, an 
enormous 8.4-meter observatory in Chile, 
will photograph large parts of the sky each 
night and could increase the number of 
known NEOs from about 18,000 to more 
than 100,000. NASA would also like to 
build the Near-Earth Object Camera (NEO- 
Cam), a space-based mission that would 
replicate many of Sentinel’s goals and 
abilities. Although the mission was not 
selected in a competition last year, it was 
given money for continued study, a sign 
that it might one day fly. 

Lindley Johnson, NASA’s planetary de- 
fense officer in Washington, D.C., says 
NEOCam could launch 4 years after it’s 
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given the go-ahead, and that it would 
likely be better at finding asteroids than 
B612’s_ constellation. He says synthetic 
tracking is good for detecting fast-moving 
objects near Earth, like debris and satel- 
lites, but won’t work as well for faint and 
distant objects. “I think they are over- 
optimistic in their assessments,’ he says. 
Shao, however, says experts at JPL have as- 
sured him that small satellites can provide 
the power, computation, and communica- 
tions needed for synthetic asteroid tracking 
to work. 

Mark Sykes, CEO and director of the 
Planetary Science Institute in Tucson, Ari- 
zona, says any of these asteroid missions 
could pay off for research as well as for 
planetary protection. Spotting asteroids 
passing closer than the moon, like the 
15 May object, would allow researchers 
to explore at close range objects that hail 
from the distant solar system. “We would 
have a one-stop shopping center for sam- 
pling material that has formed throughout 
the solar system,” he says, adding that as- 
teroid mining companies would also be in- 
terested in such near-Earth targets. 

Lu shares those hopes. He wants B612 to 
compile nearby asteroids in a user-friendly 
catalog, like a Google Maps for the solar 
system—a step that he says would help 
future scientists, explorers, and miners. 
“Tf you buy into the idea that humans will 
someday live and work and economically 
operate in space, then you need a map,” 
he says. 


Adam Mann is a journalist in 
Oakland, California. 
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INFECTIOUS DISEASES 


A call to arms against the other retrovirus 


HTLV-1, discovered just before HIV but almost forgotten, infects millions and causes cancer 


By Kai Kupferschmidt 


he discovery of the first human retro- 

virus in 1980 was a minor scien- 

tific sensation. Researchers knew 

retroviruses—which transcribe their 

RNA genome into DNA and integrate 

it into a host cell’s genome—existed in 
animals. But until Robert Gallo, then at the 
National Cancer Institute in Bethesda, Mary- 
land, found the human T-cell leukemia virus-1 
(HTLV-1), some doubted that retroviruses in- 
fect humans. HTLV-1 was soon eclipsed, how- 
ever, by another retrovirus that would go on 
to kill more than 35 million people and keep 
generations of scientists busy: HIV. “If you 
were working on retroviruses you switched 
to HIV,” says epidemiologist Antoine Gessain 
of the Pasteur Institute in Paris. 


But in a 10 May open letter to World 
Health Organization (WHO) _ Director- 
General Tedros Adhanom Ghebreyesus, 
Gallo and 59 other virologists, epidemi- 
ologists, and patient advocates call for a 
global effort to eradicate HTLV-1, which 
infects millions and causes cancer and sev- 
eral other diseases. Just like HIV, HTLV-1 
spreads through blood, semen, and breast 
milk, and the letter argues that the testing 
and prevention strategies used against HIV 
should be employed to stop HTLV-1. 

HTLV-1 is nowhere to be found among 
the many dozens of diseases on fact sheets 
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In some aboriginal communities in Australia, almost half the people over age 50 are infected with HTLV-1. 


on WHO’s website; it’s not recognized as 
a neglected tropical disease and isn’t on 
WHO's list of sexually transmitted diseases 
either. “I still scratch my head about how we 
managed to stay on the margins of HIV and 
of sexual health,” says Graham Taylor, who 
studies HTLV-1 at Imperial College London. 

The virus probably jumped from mon- 
keys to humans somewhere in Africa tens 
of thousands of years ago; the slave trade 
helped spread it around the world. Today, 
scientists estimate that 5 million to 10 mil- 
lion people are infected, but the real num- 
ber is probably much higher, Taylor says. 
One hot spot is southern Japan, where up to 
6% of pregnant women have been found to 
carry the virus. But poor countries are most 
affected, which partly explains the neglect, 
Gallo says. In Jamaica and other Caribbean 


islands, about 6% of the entire population 
may carry the virus. Parts of Brazil have 
high rates, millions are infected in Africa, 
and in some aboriginal communities in 
Australia, almost half the people over age 
50 are infected, a recent study found. Eu- 
rope and the United States can’t rest easy, 
Taylor says: “If we are not careful, this could 
become more prevalent in our communities 
without us being aware. ... As other infec- 
tions like HIV come under control, people’s 
behavior may change.” 

Most people infected with HTLV-1 never 
have symptoms, but about 3% to 5% de- 
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velop adult T-cell leukemia, a cancer that 
kills patients after 8 to 10 months on aver- 
age and for which treatment has not im- 
proved in the past quarter-century. Another 
4% develop tropical spastic paraparesis, a 
disease similar to multiple sclerosis. Other 
inflammatory diseases and immune defi- 
ciency have also been reported, and an Aus- 
tralian study published in March showed 
that those infected with HTLV-1 are more 
likely to develop bronchiectasis, a disease 
in which parts of the airways are enlarged, 
and to die from it. 

Yet HTLV-1’s modest death toll, and the 
decades that often go by between infection 
and disease, have kept it out of the limelight. 
In response to the letter, a WHO spokes- 
person pointed out that another virus, 
hepatitis B, causes more than 400,000 can- 
cers each year worldwide. In contrast, the 
International Agency for Research on Can- 
cer estimates that only 3000 cancer diagno- 
ses annually are directly linked to HTLV-1. 
Taylor thinks the real number is higher, but 
he says, “I really don’t think this should turn 
into a ‘my disease is more important than 
your disease’ competition.” 

Routine testing for HTLV-1 should be 
available in sexual health clinics everywhere, 
the researchers write in the letter, and moth- 
ers in endemic regions should routinely 
be screened for HTLV-1 and advised not to 
breastfeed if they are infected. Since ante- 
natal testing was introduced in Japan’s Na- 
gasaki region in 1987, the infection rate in 
the population has dropped from 7.2% to 
1%. The letter also calls for universal HTLV-1 
testing of blood and organ donors, because 
transplants have transmitted the virus, albeit 
rarely. “I honestly believe that nobody should 
have a transplant where the donor has not 
been tested for HTLV-1,” Taylor says. (Until 
2009, the United States did screen organ do- 
nors, but the practice was stopped because 
false positive results were disqualifying too 
many healthy organs.) 

Gallo believes renewed attention to 
HTLYV-1 could have broader benefits. In the 
early 1980s, the virus sped up the discovery 
of HIV, he says, because it alerted scientists 
to the possibility that a retrovirus might 
be causing the mysterious new syndrome 
called AIDS. Today, a better understanding 
of HTLV-1’s powerful carcinogenic proper- 
ties may lead researchers to new insights 
about cancer, Gallo says. 
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BASIC 


INSTINCTS 


Some Say artificial 
intelligence needs 
to learn like a child 


By Matthew Hutson 


ts a Saturday morning in February, 
and Chloe, a curious 3-year-old in a 
striped shirt and leggings, is exploring 
the possibilities of a new toy. Her father, 
Gary Marcus, a developmental cogni- 
tive scientist at New York University 
(NYU) in New York City, has brought 
home some strips of tape designed to 
adhere Lego bricks to surfaces. Chloe, 
well-versed in Lego, is intrigued. But she has 
always built upward. Could she use the tape 
to build sideways or upside down? Marcus 
suggests building out from the side of a ta- 
ble. Ten minutes later, Chloe starts sticking 
the tape to the wall. “We better do it before 
Mama comes back,” Marcus says in a sing- 
song voice. “She won’t be happy.” (Spoiler: 
The wall paint suffers.) 

Implicit in Marcus’s endeavor is an ex- 
periment. Could Chloe apply what she had 
learned about an activity to a new context? 
Within minutes, she has a Lego sculpture 
sticking out from the wall. “Papa, I did it!” 
she exclaims. In her adaptability, Chloe is 
demonstrating common sense, a kind of 
intelligence that, so far, computer scien- 
tists have struggled to reproduce. Marcus 
believes the field of artificial intelligence 
(AI) would do well to learn lessons from 
young thinkers like her. 

Researchers in machine learning ar- 
gue that computers trained on mountains 
of data can learn just about anything— 
including common sense—with few, if any, 
programmed rules. These experts “have a 
blind spot, in my opinion,” Marcus says. “It’s 
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a sociological thing, a form of physics envy, 
where people think that simpler is better.” He 
says computer scientists are ignoring decades 
of work in the cognitive sciences and devel- 
opmental psychology showing that humans 
have innate abilities—programmed instincts 
that appear at birth or in early childhood— 
that help us think abstractly and flexibly, like 
Chloe. He believes AI researchers ought to in- 
clude such instincts in their programs. 

Yet many computer scientists, riding high 
on the successes of machine learning, are 
eagerly exploring the limits of what a naive 
AI can do. “Most machine learning people, 
I think, have a methodological bias against 
putting in large amounts of background 
knowledge because in some sense we view 
that as a failure,” says Thomas Dietterich, a 
computer scientist at Oregon State University 
in Corvallis. He adds that computer scientists 
also appreciate simplicity and have an aver- 
sion to debugging complex code. Big compa- 
nies such as Facebook and Google are another 
factor pushing AI in that direction, says Josh 
Tenenbaum, a psychologist at the Massachu- 
setts Institute of Technology (MIT) in Cam- 
bridge. Those companies are most interested 
in narrowly defined, near-term problems, 
such as web search and facial recognition, in 
which blank-slate AI systems can be trained 
on vast data sets and work remarkably well. 

But in the longer term, computer scien- 
tists expect Als to take on much tougher 
tasks that require flexibility and common 
sense. They want to create chatbots that ex- 
plain the news, autonomous taxis that can 
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Babies are born with instincts that help us learn 
common sense, so far elusive for Al algorithms. 


handle chaotic city traffic, and robots that 
nurse the elderly. “If we want to build robots 
that can actually interact in the full human 
world like C-3PO,’ Tenenbaum says, “we’re 
going to need to solve all of these problems 
in much more general settings.” 

Some computer scientists are already try- 
ing. In February, MIT launched Intelligence 
Quest, a research initiative now raising hun- 
dreds of millions of dollars to understand 
human intelligence in engineering terms. 
Such efforts, researchers hope, will result in 
Als that sit somewhere between pure ma- 
chine learning and pure instinct. They will 
boot up following some embedded rules, but 
will also learn as they go. “In some sense this 
is like the age-old nature-nurture debate, 
now translated into engineering terms,’ 
Tenenbaum says. 

Part of the quest will be to discover what 
babies know and when-—lessons that can 
then be applied to machines. That will take 
time, says Oren Etzioni, CEO of the Allen 
Institute for Artificial Intelligence (AI2) in 
Seattle, Washington. AI2 recently announced 
a $125 million effort to develop and test com- 
mon sense in AI. “We would love to build on 
the representational structure innate in the 
human brain,’ Etzioni says, “but we don’t 
understand how the brain processes lan- 
guage, reasoning, and knowledge.” 

Ultimately, Tenenbaum says, “We're trying 
to take one of the oldest dreams of AI seri- 
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ously: that you could build a machine that 
grows into intelligence the way a human 
does—that starts like a baby and learns like 
a child” 


IN THE PAST FEW YEARS, AI has shown that it 
can translate speech, diagnose cancer, and 
beat humans at poker. But for every win, 
there is a blunder. Image recognition algo- 
rithms can now distinguish dog breeds bet- 
ter than you can, yet they sometimes mistake 
a chihuahua for a blueberry muffin. Als can 
play classic Atari video games such as Space 
Invaders with superhuman skill, but when 
you remove all the aliens but one, the AI 
falters inexplicably. 

Machine learning—one type of AI—is 
responsible for those successes and failures. 
Broadly, AI has moved from software that 
relies on many programmed rules 
(also known as Good Old-Fashioned 
AI, or GOFAD) to systems that learn 
through trial and error. Machine 
learning has taken off thanks to 
powerful computers, big data, and 
advances in algorithms called neu- 
ral networks. Those networks are 
collections of simple computing 
elements, loosely modeled on neu- 
rons in the brain, that create stron- 
ger or weaker links as they ingest 
training data. 

With its Alpha programs, Google’s 
DeepMind has pushed deep learn- 
ing to its apotheosis. Each time 
rules were subtracted, the soft- 
ware seemed to improve. In 2016, 
AlphaGo beat a human champion at 
Go, a classic Chinese strategy game. 
The next year, AlphaGo Zero easily 
beat AlphaGo with far fewer guide- 
lines. Months later, an even simpler system 
called AlphaZero beat AlphaGo Zero—and 
also mastered chess. In 1997, a classic, rule- 
based AI, IBM’s Deep Blue, had defeated 
chess champion Garry Kasparov. But it turns 
out that true chess virtuosity lies in know- 
ing the exceptions to the exceptions to the 
exceptions—information best gleaned 
through experience. AlphaZero, which learns 
by playing itself over and over, can beat Deep 
Blue, today’s best chess programs, and every 
human champion. 

Yet systems such as Alpha clearly are not 
extracting the lessons that lead to common 
sense. To play Go on a 21-by-21 board in- 
stead of the standard 19-by-19 board, the AI 
would have to learn the game anew. In the 
late 1990s, Marcus trained a network to take 
an input number and spit it back out—about 
the simplest task imaginable. But he trained 
it only on even numbers. When tested with 
odd numbers, the network floundered. It 
couldn’t apply learning from one domain to 
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another, the way Chloe had when she began 
to build her Lego sideways. 

The answer is not to go back to rule-based 
GOFAIs. A child does not recognize a dog with 
explicit rules such as “if number of legs=4, 
and tail=true, and size>cat.’ Recognition is 
more nuanced—a chihuahua with three legs 
won't slip past a 3-year-old. Humans are not 
blank slates, nor are we hardwired. Instead, 
the evidence suggests we have predisposi- 
tions that help us learn and reason about 
the world. Nature doesn’t give us a library of 
skills, just the scaffolding to build one. 

Harvard University psychologist Elizabeth 
Spelke has argued that we have at least four 
“core knowledge” systems giving us a head 
start on understanding objects, actions, 
numbers, and space. We are intuitive physi- 
cists, for example, quick to understand ob- 


Ina triumph of machine learning, AlphaGo beat Go champion Ke Jie in 2017. 


jects and their interactions. According to one 
study, infants just 3 days old interpret the 
two ends of a partially hidden rod as parts 
of one entity—a sign that our brains might 
be predisposed to perceive cohesive objects. 
We're also intuitive psychologists. In a 2017 
Science study, Shari Liu, a graduate student 
in Spelke’s lab, found that 10-month-old 
infants could infer that when an animated 
character climbs a bigger hill to reach one 
shape than to reach another, the character 
must prefer the former. Marcus has shown 
that 7-month-old infants can learn rules; 
they show surprise when three-word sen- 
tences (“wo fe fe”) break the grammatical 
pattern of previously heard sentences (“ga 
ti ga”). According to later research, day-old 
newborns showed similar behavior. 

Marcus has composed a minimum list of 
10 human instincts that he believes should 
be baked into Als, including notions of cau- 
sality, cost-benefit analysis, and types versus 
instances (dog versus my dog). Last October 
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at NYU, he argued for his list in a debate on 
whether AI needs “more innate machinery,” 
facing Yann LeCun, an NYU computer scien- 
tist and Facebook’s chief AI scientist. To dem- 
onstrate his case for instinct, Marcus showed 
a slide of baby ibexes descending a cliff. “They 
don’t get to do million-trial learning,” he said. 
“Tf they make a mistake, it’s a problem.” 

LeCun, disagreeing with many develop- 
mental psychologists, argued that babies 
might be learning such abilities within days, 
and if so, machine learning algorithms could, 
too. His faith comes from experience. He 
works on image recognition, and in the 1980s 
he began arguing that hand-coded algorithms 
to identify features in pictures would become 
unnecessary. Thirty years later, he was proved 
right. Critics asked him: “Why learn it when 
you can build it?” His reply: Building is hard, 
and if you don’t fully understand 
how something works, the rules you 
devise are likely to be wrong. 

But Marcus pointed out that 
LeCun himself had embedded 
one of the 10 key instincts into his 
image-recognition algorithms: 
translational invariance, the abil- 
ity to recognize an object no mat- 
ter where it appears in the visual 
field. Translational invariance is the 
principle behind convolutional neu- 
ral networks, or convnets, LeCun’s 
greatest claim to fame. In the past 
5 years they've become central to 
image recognition and other AI ap- 
plications, kicking off the current 
craze for deep learning. 

LeCun tells Science that transla- 
tional invariance, too, could eventu- 
ally emerge on its own with better 
general learning mechanisms. “A 
lot of those items will kind of spontaneously 
pop up as a consequence of learning how 
the world works,” he says. Geoffrey Hinton, 
a pioneer of deep learning at the University 
of Toronto in Canada, agrees. “Most of the 
people who believe in strong innate knowl- 
edge have an unfounded belief that it’s hard 
to learn billions of parameters from scratch,” 
he says. “I think recent progress in deep 
learning has shown that it is actually surpris- 
ingly easy.” 

The debate over where to situate an AI 
on a spectrum between pure learning and 
pure instinct will continue. But that issue 
overlooks a more practical concern: how to 
design and code such a blended machine. 
How to combine machine learning—and 
its billions of neural network parameters— 
with rules and logic isn’t clear. Neither is 
how to identify the most important in- 
stincts and encode them flexibly. But that 
hasn’t stopped some researchers and com- 
panies from trying. 
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A ROBOTICS LABORATORY at The University 
of New South Wales in Sydney, Australia, 
is dressed to look like a living room and 
kitchen—complete with bottles of James 
Boag’s Premium Lager in the fridge. Com- 
puter scientist Michael Thielscher explains 
that the lab is a testbed for a domestic robot. 
His team is trying to endow a Toyota Human 
Support Robot (HSR), which has one arm 
and a screen for a face, with two humanlike 
instincts. First, they hope to program the 
HSR to decompose challenges into smaller, 
easier problems, just as a person would parse 
a recipe into several steps. Second, they want 
to give the robot the ability to reason about 
beliefs and goals, the way humans instinc- 
tively think about the minds of others. How 
would the HSR respond if a person asked it 
to fetch a red cup, and it found only a blue 
cup and a red plate? 

So far, their software shows 
some humanlike abilities, in- 
cluding the good sense to fetch 
the blue cup rather than the 
red plate. But more rules are 
programmed into the system 
than Thielscher would like. 


His team has had to tell their 
AI that cup is usually more 


important than red. Ideally, 
a robot would have the social 


bricks. Because the schema network could 
learn about causal relationships—such as 
the fact that the ball knocks out bricks on 
contact no matter its velocity—it didn’t 
need extra training when the game was al- 
tered. You could move the target bricks or 
make the player juggle three balls, and the 
schema network still aced the game. The 
other network flailed. 

Besides our innate abilities, humans also 
benefit from something most Als don’t have: 
a body. To help software reason about the 
world, Vicarious is “embodying” it so it can 
explore virtual environments, just as a baby 
might learn something about gravity by top- 
pling a set of blocks. In February, Vicarious 
presented a system that looked for bounded 
regions in 2D scenes by essentially having 
a tiny virtual character traverse the terrain. 


Different minds 
Over time, artificial intelligence (Al) has shifted from algorithms that rely on programmed 
rules and logic—instincts—to machine learning, where algorithms contain few rules and 

ingest training data to learn by trial and error. Human minds sit somewhere in the middle. 


California, helps develop probabilistic pro- 
gramming languages (PPLs). He describes 
them as combining the rigid structures of 
computer code with the mathematics of 
probability, echoing the way people can fol- 
low logic but also allow for uncertainty: If the 
grass is wet it probably rained—but maybe 
someone turned on a sprinkler. Crucially, 
a PPL can be combined with deep learning 
networks to incorporate extensive learning. 
While working at Uber, Goodman and others 
invented such a “deep PPL,” called Pyro. The 
ride-share company is exploring uses for Pyro 
such as dispatching drivers and adaptively 
planning routes amid road construction and 
game days. Goodman says PPLs can reason 
not only about physics and logistics, but also 
about how people communicate, coping with 
tricky forms of expression such as hyperbole, 
irony, and sarcasm. 


CHLOE MIGHT NOT MASTER sar- 
casm until her teen years, but 
her inborn knack for language 
is already clear. At one point in 
Marcus’s apartment, she holds 
out a pair of stuck Lego bricks. 


“Papa, can you untach this 
for me?” Her father obliges 


<2 


without correcting her coin- 
age. Words and ideas are like 


instincts to quickly learn peo- INSTINCT LEARNING Lego pieces, their parts read- 
ple’s preferences on its own. ily mixed and matched, and 
Other researchers are work- eagerly tested in the world. 
ing to inject their Als with the Rule-based Al Humans ; Machine learning Al After Chloe tires of build- 
same intuitive physics that IBM s Deep Blue, Babies learn by trial and With few programmed ing on the wall, an older, 
: if which bested error. But developmental rules, DeepMind’s : ; 
babies seem to be born with. chess champion cognitive scientists say we AlphaZero today slightly more seasoned intel- 


Computer scientists at Deep- 
Mind in London have devel- 
oped what they call interaction 
networks. They incorporate an 
assumption about the physical world: that 
discrete objects exist and have distinctive 
interactions. Just as infants quickly parse the 
world into interacting entities, those systems 
readily learn objects’ properties and relation- 
ships. Their results suggest that interaction 
networks can predict the behavior of falling 
strings and balls bouncing in a box far more 
accurately than a generic neural network. 
Vicarious, a robotics software company in 
San Francisco, California, is taking the idea 
further with what it calls schema networks. 
Those systems, too, assume the existence of 
objects and interactions, but they also try to 
infer the causality that connects them. By 
learning over time, the company’s software 
can plan backward from desired outcomes, 
as people do. (I want my nose to stop itch- 
ing; scratching it will probably help.) The 
researchers compared their method with a 
state-of-the-art neural network on the Atari 
game Breakout, in which the player slides 
a paddle to deflect a ball and knock out 
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Garry Kasparov 
in 1997, relied on 
rules and logic. 


also begin life with basic 
instincts that help us gain a 
flexible common sense. 


As it explored, the system learned the con- 
cept of containment, which helps it make 
sense of new scenes faster than a standard 
image-recognition convnet that passively 
surveyed each scene in full. Concepts— 
knowledge that applies to many 
situations—are crucial for common sense. 
“In robotics it’s extremely important that 
the robot be able to reason about new 
situations,” says Dileep George, a co-founder 
of Vicarious. Later this year, the company 
will pilot test its software in warehouses 
and factories, where it will help robots pick 
up, assemble, and paint objects before pack- 
aging and shipping them. 

One of the most challenging tasks is to 
code instincts flexibly, so that Als can cope 
with a chaotic world that does not always fol- 
low the rules. Autonomous cars, for example, 
cannot count on other drivers to obey traffic 
laws. To deal with that unpredictability, Noah 
Goodman, a psychologist and computer sci- 
entist at Stanford University in Palo Alto, 
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can beat IBM's Deep 
Blue at chess. But it 
doesn't generalize. 


ligent system gets a chance to 
try it: her brother Alexander, 
age 5. He quickly constructs a 
Lego building that protrudes 
farther. “You can see the roots of what he’s 
doing in what she did,” Marcus says. When 
asked, Alexander estimates how far the 
structure might stick out before it would 
fall. “He’s pretty well-calibrated,;’ Marcus 
observes. “He hasn’t had 10 million trials 
of glued-on-the-wall Lego things in order to 
assess the structural integrity. He’s taking 
what he knows about physics, and so forth, 
and making some inferences.” 

Marcus is obviously proud, not only of his 
offspring’s capabilities, but also that they 
uphold his theories of how we learn about 
the world—and how Als should be learning, 
too. Done with their Lego buildings, Chloe 
and Alexander leap into their father’s arms. 
They squeal with delight as he spins them 
around and around, offering them another 
chance to fine-tune their intuitive senses of 
physics—and fun. 


Matthew Hutson is a journalist based in 
New York City. 
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Wheat sensitivity isn’t imaginary, 
most researchers now agree. But 
what's really behind it? 


By Kelly Servick 


he patients weren’t crazy—Knut Lundin was sure of that. But 
their ailment was a mystery. They were convinced gluten was 
making them sick. Yet they didn’t have celiac disease, an auto- 
immune reaction to that often-villainized tangle of proteins in 
wheat, barley, and rye. And they tested negative for a wheat al- 
lergy. They occupied a medical no man’s land. 

About a decade ago, gastroenterologists like Lundin, based 
at the University of Oslo, came across more and more of those 
enigmatic cases. “I worked with celiac disease and gluten for so 

many years,” he says, “and then came this wave.” Gluten-free choices began 
appearing on restaurant menus and creeping onto grocery store shelves. 
By 2014, in the United States alone, an estimated 3 million people without 
celiac disease had sworn off gluten. It was easy to assume that people 
claiming to be “gluten sensitive” had just been roped into a food fad. 
“Generally, the reaction of the gastroenterologist [was] to say, “You 
don’t have celiac disease or wheat allergy. Goodbye,” says Armin Alaedini, 
an immunologist at Columbia University. “A lot of people thought this is 
perhaps due to some other [food] sensitivity, or it’s in people’s heads.” 
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Stretchy, resilient gluten allows bread 


to rise. But millions now shun it. 
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But a small community of researchers 
started searching for a link between wheat 
components and patients’ symptoms— 
commonly abdominal pain, bloating, and 
diarrhea, and sometimes headaches, fa- 
tigue, rashes, and joint pain. That wheat re- 
ally can make nonceliac patients sick is now 
widely accepted. But that’s about as far as 
the agreement goes. 

As data trickle in, entrenched camps 
have emerged. Some researchers are con- 
vinced that many patients have an immune 
reaction to gluten or another substance in 
wheat—a nebulous illness sometimes called 
nonceliac gluten sensitivity (NCGS). 

Others believe most patients are actually 
reacting to an excess of poorly absorbed 
carbohydrates present in wheat and many 
other foods. Those carbohydrates—called 
FODMAPs, for fermentable oligosaccha- 
rides, disaccharides, monosaccharides, and 
polyols—can cause bloating when they fer- 
ment in the gut. If FODMAPs are the primary 
culprit, thousands of people may be on glu- 
ten-free diets with the support of their doc- 
tors and dietitians but without good reason. 

Those competing theories were on dis- 
play in a session on wheat sensitivity at a 
celiac disease symposium held at Columbia 
in March. In back-to-back talks, Lundin 
made the case for FODMAPs, and Alaedini 
for an immune reaction. But in an irony that 
underscores how muddled the field has be- 
come, both researchers started their quests 
believing something completely different. 


KNOWN WHEAT-RELATED ILLNESSES have 
clear mechanisms and markers. People with 
celiac disease are genetically predisposed to 
launch a self-destructive immune response 
when a component of gluten called gliadin 
penetrates their intestinal lining and sets 
off inflammatory cells in the tissue below. 
People with a wheat allergy respond to 
wheat proteins by churning out a class of 
antibodies called immunoglobulin E that 
can set off vomiting, itching, and shortness 
of breath. The puzzle, for both doctors and 
researchers, is patients who lack both the 
telltale antibodies and the visible damage 
to their intestines but who feel real relief 
when they cut out gluten-containing food. 
Some doctors have begun to approve 
and even recommend a gluten-free diet. 
“Ultimately, we’re here not to do science, 
but to improve quality of life,’ says Alessio 
Fasano, a pediatric gastroenterologist at 
Massachusetts General Hospital in Boston 
who has studied NCGS and written a book 
on living gluten-free. “If I have to throw 
bones on the ground and look at the moon 
to make somebody better, even if I don’t 
understand what that means, I’1l do it.” 
Like many doctors, Lundin believed that 
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(fad dieters and superstitious eaters aside) 
some patients have a real wheat-related 
ailment. His group helped dispel the no- 
tion that NCGS was purely psychosomatic. 
They surveyed patients for unusual levels 
of psychological distress that might express 
itself as physical symptoms. But the sur- 
veys showed no differences between those 
patients and people with celiac disease, the 
team reported in 2012. As Lundin bluntly 
puts it: “We know they are not crazy.” 

Still, skeptics worried that the field had 
seized on gluten with shaky evidence that 
it was the culprit. After all, nobody eats glu- 
ten in isolation. “If we did not know about 
the specific role of gluten in celiac dis- 
ease, we would never have thought gluten 
was responsible for [NCGS],” says Stefano 
Guandalini, a pediatric gastroenterologist 
at the University of Chicago Medical Center 
in Illinois. “Why blame gluten?” 

Defenders of NCGS generally acknowl- 
edge that other components of wheat might 


Against the grain 

Data from the National Health and Nutrition 
Examination Survey show the rising tide of gluten 
avoidance by people without celiac disease. 
Celiac diagnoses also rose, but probably not its 
actual prevalence. 
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contribute to symptoms. In 2012, a group of 
proteins in wheat, rye, and barley called am- 
ylase trypsin inhibitors emerged as a poten- 
tial offender, for example, after a team led 
by biochemist Detlef Schuppan of Johannes 
Gutenberg University Mainz in Germany 
(then at Harvard Medical School in Boston) 
reported that those proteins can provoke 
immune cells. 

But without biological markers to iden- 
tify people with NCGS, researchers have 
relied on self-reported symptoms measured 
through a “gluten challenge”: Patients rate 
how they feel before and after cutting out 
gluten. Then doctors reintroduce gluten 
or a placebo—ideally disguised in indistin- 
guishable pills or snacks—to see whether 
the symptoms tick back up. 

Alaedini has recently hit on a more ob- 
jective set of possible biological markers— 
much to his own surprise. “I entered this 
completely as a skeptic,’ he says. Over his 
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career, he has gravitated toward studying 
spectrum disorders, in which diverse symp- 
toms have yet to be united under a clear 
biological cause—and where public mis- 
information abounds. His team published a 
study in 2013, for example, that debunked 
the popular suggestion that children with 
autism had high rates of Lyme disease. “I 
do studies [where] there is a void,” he says. 

In NCGS, Alaedini saw another poorly 
defined spectrum disorder. He did accept 
that patients without celiac disease might 
somehow be sensitive to wheat, on the basis 
of several trials that measured symptoms 
after a blinded challenge. But he was not 
convinced by previous studies claiming that 
NCGS patients were more likely than other 
people to have certain antibodies to glia- 
din. Many of those studies lacked a healthy 
control group, he says, and relied on com- 
mercial antibody kits that gave murky and 
inconsistent readings. 

In 2012, he contacted researchers at the 
University of Bologna in Italy to obtain 
blood samples from 80 patients their team 
had identified as gluten sensitive on the ba- 
sis of a gluten challenge. He wanted to test 
the samples for signs of a unique immune 
response—a set of signaling molecules dif- 
ferent from those in the blood of healthy 
volunteers and celiac patients. He wasn’t 
optimistic. “I thought if we were going to 
see something, like with a lot of spectrum 
conditions that I have looked at, we would 
see small differences.” 

The results shocked him. Compared with 
both healthy people and those with celiac, 
these patients had significantly higher lev- 
els of a certain class of antibodies against 
gluten that suggest a short-lived, systemic 
immune response. That didn’t mean gluten 
itself was causing disease, but the finding 
hinted that the barrier of those patients’ 
intestines might be defective, allowing 
partially digested gluten to get out of the 
gut and interact with immune cells in the 
blood. Other elements—such as immune 
response-provoking bacteria—also might 
be escaping. Sure enough, the team found 
elevated levels of two proteins that indi- 
cate an inflammatory response to bacteria. 
And when 20 of the same patients spent 
6 months on a gluten-free diet, their blood 
levels of those markers declined. 

For Alaedini, the beginnings of a mech- 
anism emerged: Some still-unidentified 
wheat component prompts the intestinal 
lining to become more permeable. (An im- 
balance in gut microbes might be a predis- 
posing factor.) Components of bacteria then 
seem to sneak past immune cells in the 
underlying intestinal tissue and make their 
way to the bloodstream and liver, prompt- 
ing inflammation. 
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“This is a real condition, and there can 
be objective, biological markers for it,’ 
Alaedini says. “That study changed a lot of 
minds, including my own.” 

The study also impressed Guandalini, a 
longtime skeptic about the role of gluten. It 
“opens the way to finally reach an identifi- 
able marker for this condition,” he says. 


BUT OTHERS SEE the immune-response ex- 
planation as a red herring. To them, the 
primary villain is FODMAPs. The term, 
coined by gastroenterologist Peter Gibson 
at Monash University in Melbourne, Aus- 
tralia, and his team, encompasses a smor- 
gasbord of common foods. Onions and 
garlic; legumes; milk and yogurt; and fruits 
including apples, cherries, and mangoes are 
all high in FODMAPs. So is wheat: Carbs 
in wheat called fructans can account for as 
much as half of a person’s FODMAP intake, 
dietitians in Gibson’s group have estimated. 
The team found that those compounds fer- 
ment in the gut to cause symptoms of ir- 
ritable bowel syndrome, such as abdominal 
pain, bloating, and gas. 

Gibson has long been skeptical of stud- 
ies implicating gluten in such symptoms, 
arguing that those findings are hopelessly 
clouded by the nocebo effect, in which the 
mere expectation of swallowing the dreaded 
ingredient worsens symptoms. His team 
found that most patients couldn’t reliably 
distinguish pure gluten from a placebo in a 
blinded test. He believes that many people 
feel better after eliminating wheat not be- 
cause they have calmed some intricate im- 
mune reaction, but because they’ve reduced 
their intake of FODMAPs. 

Lundin, who was firmly in the immune- 
reaction camp, didn’t believe that FOD- 
MAPs could explain away all his patients. 
“T wanted to show that Peter was wrong,” 
he says. During a 2-week sabbatical in the 
Monash lab, he found some quinoa-based 
snack bars designed to disguise the taste 
and texture of ingredients. “I said, ‘We’re 
going to take those muesli bars and we're 
going to do the perfect study.” 

His team recruited 59 people on self- 
instituted gluten-free diets and randomized 
them to receive one of three indistinguish- 
able snack bars, containing isolated gluten, 
isolated FODMAP (fructan), or neither. After 
eating one type of bar daily for a week, they 
reported any symptoms. Then they waited 
for symptoms to resolve and started on a dif- 
ferent bar until they had tested all three. 

Before analyzing patient responses, 
Lundin was confident that gluten would 
cause the worst symptoms. But when the 
study’s blind was lifted, only the FODMAP 
symptoms even cleared the bar for statistical 
significance. Twenty-four of the 59 patients 
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had their highest symptom scores after a 
week of the fructan-laced bars. Twenty-two 
responded most to the placebo, and just 
13 to gluten, Lundin and his collaborators— 
who included Gibson—reported last Novem- 
ber in the journal Gastroenterology. Lundin 
now believes FODMAPs explain the symp- 
toms in most wheat-avoiding patients. “My 
main reason for doing that study was to find 
out a good method of finding gluten-sensitive 
individuals,” he says. “And there were none. 
And that was quite amazing.” 


AT THE COLUMBIA MEETING, Alaedini and 
Lundin went head to head in consecutive 
talks titled “It’s the Wheat” and “It’s FOD- 
MAPS.” Each has a list of criticisms of the 
other’s study. Alaedini contends that by re- 
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cruiting broadly from the gluten-free popula- 
tion, instead of finding patients who reacted 
to wheat in a challenge, Lundin likely failed 
to include people with a true wheat sensitiv- 
ity. Very few of Lundin’s subjects reported 
symptoms outside the intestines, such as 
rash or fatigue, that might point to a wide- 
spread immune condition, Alaedini says. And 
he notes that the increase in patients’ symp- 
toms in response to the FODMAP snacks was 
just barely statistically significant. 

Lundin, meanwhile, points out that 
the patients in Alaedini’s study didn’t 
go through a blinded challenge to check 
whether the immune markers he identified 
really spiked in response to wheat or gluten. 
The markers may not be specific to people 
with a wheat sensitivity, Lundin says. 

Despite the adversarial titles of their 
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Although consumers focus on gluten, other wheat components could be at the root of symptoms. 


talks, the two researchers find a lot of com- 
mon ground. Alaedini agrees that FOD- 
MAPs explain some of the wheat-avoidance 
phenomenon. And Lundin acknowledges 
that some small population may really have 
an immune reaction to gluten or another 
component of wheat, though he sees no 
good way to find them. 

After the meeting, Elena Verdi, a 
gastroenterologist at McMaster University 
in Hamilton, Canada, puzzled over the po- 
larization of the field. “I don’t understand 
why there is this need to be so dogmatic 
about ‘it is this, it is not that,” she says. 

She worries that the scientific confu- 
sion breeds skepticism toward people 
who avoid gluten for medical reasons. 
When she dines with celiac patients, she 


# 


says, waiters sometimes meet requests for 
gluten-free food with smirks and ques- 
tions. Meanwhile, the conflicting messages 
may send nonceliac patients down a food- 
avoidance rabbit hole. “Patients are with- 
drawing gluten first, then lactose, and then 
FODMAPs—and then they are on a really, 
really poor diet,” she says. 

But Verdt believes careful research will 
ultimately break through the superstitions. 
She is president of the North American So- 
ciety for the Study of Celiac Disease, which 
this year awarded its first grant to study 
nonceliac wheat sensitivity. She’s hopeful 
that the search for biomarkers like those 
Alaedini has proposed will show that inside 
the monolith of gluten avoidance lurk mul- 
tiple, nuanced conditions. “It will be diffi- 
cult,” she says, “but we are getting closer.” 
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White storks migrate in large flocks 
between their breeding sites in Europe and 
their overwintering grounds in Africa. 


Following the leader, for better or worse 


Juvenile white storks use different behavioral strategies for thermal soaring 


By Gabrielle A. Nevitt 


hite storks (Ciconia ciconia) are 
highly social birds that migrate 
thousands of kilometers each year 
between breeding locations in Eu- 
rope and overwintering grounds 
in Africa. The birds migrate dur- 
ing the day in large, seemingly disorganized 
flocks and rely on thermals to power their 
flight (7). On page 911 of this issue, Flack et 
al. (2) provide a glimpse into the behavioral 
interactions that white storks use to exploit 
atmospheric thermals in a migratory flock. 
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Thermals form when air over Earth’s sur- 
face heats up during daylight hours. Warm air 
rises because it is less dense, and the resulting 
thermal provides a source of vertical lift that 
soaring birds can exploit (7). Soaring birds 
like white storks have evolved adaptations, 
such as broad stiff wings, that enable them 
to soar on thermals like a glider, thus allow- 
ing them to travel more efficiently than they 
could by flapping flight alone (3). However, 
because thermals are not always predictable, 
the challenge for white storks is to find ap- 
propriate thermals that they can use to gain 
sufficient altitude to power a long glide. 
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During their first migration, juveniles 
and adults migrate together to overwinter- 
ing sites, but it has been unclear whether 
individuals work collectively to exploit ther- 
mals to migrate these great distances (4, 5). 
Given the technical challenge of simultane- 
ously tracking individuals within flocks, a 
first step in addressing this question is to 
examine how juveniles interact with each 
other as they take off for their first migra- 
tion shortly after fledging. 

During the summer of 2014, Flack ez al. 
followed a flock of 27 fledgling white storks 
from a small colony of 22 nests as they mi- 
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grated for the first time from 
their breeding colony in south- 
ern Germany to Spain. Birds 
were tagged with devices that 
allowed the researchers to si- 
multaneously monitor each 
individual’s geographical po- 
sition, elevation, speed, and 
direction. The birds were also 
equipped with three axis ac- 
celerometers to monitor how 
frequently they were flapping 
their wings. The researchers 
focused on the first 5 days 
after the fledglings started 


1 The storks at the 
front of the flock 
locate the thermal 
and begin to climb. 


The leader-follower model for exploiting thermals 
Juvenile white storks migrate in flocks, but individuals vary in their flight performance. 
Leaders search for thermals, whereas followers find thermals by watching the leaders. 
Leaders start their glide from a higher altitude than followers, presumably because their 
departure prompts the others to follow them. Followers cannot migrate as far as leaders 
because they must expend more energy on flapping flight. 


™-, 


Vi 


Followers 


that are adapted to travel 
in V-formation to conserve 
energy (6) have been shown 
to frequently change posi- 
tion in a pairwise manner, 
such that birds spend simi- 
lar amounts of time lead- 
ing and following (7). In 
juvenile Northern bald ibis 
(Geronticus eremita), the 
V-formation gives birds in 
the rear an aerodynamic ad- 
vantage, but, by frequently 
exchanging positions, that 
advantage is more evenly 


their migration. 

The researchers _ identi- 
fied three modes of flight— 
flapping flight, thermalling, 
and gliding—and also deter- 
mined how frequently birds 
flapped their wings. To further 
examine interactions between 
individuals as they located 


2 The flock climbs 
together until the 
leaders reach the 
top of the thermal. 


distributed among the flock 
members (7). 

An added intriguing wrin- 
kle in this story is that juve- 
nile white storks frequently 
migrate with adults, which 
would seem to impose a 
handicap on juveniles. Juve- 


nile white storks rely on flap- 


and exploited thermals, the re- 
searchers developed a precise 


method to measure the time leave the thermal 

required for a tagged bird to at a lower altitude 

reach the position of another c vad up with the 
eaders. 


tagged bird. Thus, this study 
provides a_ high-resolution 
picture of how white storks 
interact with each other dur- 


3 The following birds 


ping flight more than adults, 
and this higher energetic 
cost of trying to keep up is 
thought to contribute to ju- 
venile morbidity during the 
first migration (5). One might 
also predict that more-experi- 
enced birds should position 
themselves in leader positions 


ing their first migration. 

The data suggest that how 
individuals find and exploit thermals is re- 
lated to an underlying initial variation in 
flight performance. Juvenile white storks 
with the lowest proportion of wing-flapping 
activity tended to fly ahead of birds that 
flapped their wings more. The individuals 
in the lead were able to find thermals and 
circle to higher altitudes than the birds that 
made up the rear of the flock; instead of 
searching for thermals on their own, these 
birds followed the leaders directly to them. 
Once the followers arrived at the thermal, 
they could climb rapidly, but they were also 
inclined to leave the thermals prematurely 
at a lower altitude, presumably owing to a 
drive to keep up with the rest of the flock 
(see the figure). 

Flack et al. were also able to examine how 
far birds traveled during the first 4 weeks of 
migration. Migratory distance was strongly 
correlated with the birds’ initial wing-flap- 
ping behavior. Birds that flew ahead in the 
flock and needed to flap their wings less mi- 
grated farther than birds that followed and 
that needed to flap their wings more. 
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The authors suggest that differences in 
flight performance in juvenile storks lead to 
different behavioral strategies for exploiting 
thermals, giving individuals that are leaders 
an edge in how far they get on their first mi- 
gration. However, the authors could not ex- 
plain these differences by body mass, sex, or 
other demographic parameters that they re- 
corded. This begs the question as to whether 
the leader-follower strategy might be context 
dependent. The study focused on a single 
flock for just 5 days, but there is a natural 
variation in flight performance among a co- 
hort of fledglings, and a follower in one flock 
could drop out to be a leader in another flock 
of more evenly matched individuals. 

The researchers’ findings illustrate how 
dynamic flocking behavior is. They initially 
tagged all 61 fledglings at the colony, but the 
study group was reduced to just 27 birds be- 
cause some individuals left the focal flock, 
stopped migrating altogether, or died. In 
addition, the tagged juveniles were joined 
by unmonitored individuals. As the authors 
point out, interactions with these untagged 
individuals could not be assessed. 

Further research might explore whether 
leader-follower roles change within the 
flocks themselves. Other migratory species 
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in mixed-aged flocks, not only 
because they are stronger fli- 
ers, but also because they know where the 
flock needs to go (8). However, this might 
put younger birds at a disadvantage be- 
cause there is an energetic cost to being 
a follower if they are constantly trying to 
keep up with the faster leaders. Compari- 
sons of age-related performance in the use 
of thermals by Eurasian griffon vultures 
(Gyps fulvus) suggest that juveniles do not 
use thermals as efficiently under intermedi- 
ate wind shear (4). It will be intriguing to 
explore how thermal soaring contributes to 
organized movement of individuals within 
mixed-aged migratory flocks. m 
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Enhancing energy transport 
in conjugated polymers 


Scalable synthesis enables long-distance exciton transport 
in crystalline polymer nanofibers 


By Russell J. Holmes 


he conversion of light into usable 
chemical energy by plants is enabled 
by the precise spatial arrangement of 
light-absorbing photosynthetic sys- 
tems and associated molecular com- 
plexes (1). In organic solar cells, there 
is also the need to control intermolecular 
spacing and molecular orientation, as well 
as thin-film crystallinity and morphology, 
so as to enable efficient energy migration 
and photoconversion (2). In an organic so- 
lar cell, light absorption creates excitons, 
tightly bound electron-hole pairs that must 
be efficiently dissociated into their compo- 
nent charge carriers in order to create an 
electrical current. Thus, long-range exciton 
migration must occur from the point of 
photogeneration to a dissociating site. On 
page 897 of this issue, Jin et al. (3) report on 


a conjugated polymer nanofiber system that 
yields exciton diffusion lengths greater than 
200 nm. In comparison, organic solar cells 
are typically constructed with materials 
having exciton diffusion lengths one order 
of magnitude smaller than this value, which 
limits device thickness and optical absorp- 
tion. Their approach exploits a sequential 
synthesis method that enables measure- 
ment of this long exciton diffusion length 
(see the figure). 

Exciton diffusion is typically considered 
to occur by means of two mechanisms 
(4). The first is through direct electron- 
exchange between nearest neighbors, also 
known as Dexter transfer. Because this 
mechanism involves the physical exchange 
of electrons, it requires the associated do- 
nor and acceptor molecules to have over- 
lapping electronic orbitals. The second 
mechanism for energy transfer occurs via 


Probing excitons 


Jin et al. studied long-range exciton transport in a nanofiber core surrounded by either an insulating, 
nonquenching corona or a quenching corona that can accept excitons from the core. 


«—————— L ight excites the nanofiber core —-———————_> 


Nanofiber —-—————_e 


Insulating ———%—e 
corona 


With a fully insulating corona, excitons 
migrate over distances exceeding 200 nm. 


854 25 MAY 2018 + VOL 360 ISSUE 6391 


i i Excitons 
Quencher ————e 
photoluminescence 


-— Nanofiber core 
photoluminescence 


——{—— Nanofiber core 


Quenching 
corona 


The exciton diffusion length can be extracted by 
reducing the insulating corona segment length. 
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the nonradiative coupling of electronic 
dipoles, termed Forster transfer. Forster 
transfer contrasts with Dexter transfer in 
that it may occur through occupied space, 
given that it is mediated by the electro- 
magnetic field. Because Forster transfer 
fundamentally requires that the donor 
molecule be capable of light emission, dark 
states are precluded from this mechanism 
and instead migrate exclusively via Dexter 
transfer. In highly luminescent materi- 
als, the rate of Forster transfer is typically 
faster than the rate of Dexter transfer and 
is responsible for transport. 

Jin et al. probe exciton migration in 
crystalline, luminescent nanofibers of 
poly(di-n-hexylfluorene) (PDHF) using 
photoluminescence quenching. At its core, 
this method involves probing the photolu- 
minescence intensity or decay kinetics from 
a material of interest subject to an exciton 
quencher placed at a known distance from 
the photogeneration site. Measurement of 
the photoluminescence as the distance to 
the quencher is varied allows the exciton 
diffusion length to be determined. In order 
to measure the exciton diffusion length of 
PDHF, nanofibers were synthesized from 
segments that have coronae consisting of 
either poly(ethylene glycol) (PEG) or quat- 
ernized polythiophene (QPT). 

An electrically insulating PEG corona 
has a large energy gap and hence will not 
interact with an exciton in PDHF. By con- 
trast, QPT has a narrow energy gap and 
will actively accept excitons and quench 
photoluminescence from the PDHF core. 
Jin et al. extracted exciton diffusion lengths 
by systematically varying the PEG segment 
length, which in turn varies the number of 
excitons that reach the corona, and hence, 
the photoluminescence from QPT. This ele- 
gant photoluminescence quenching exper- 
iment is facilitated by the use of a scalable, 
seeded growth method, in which diblock 
copolymer segments of either PDHF-PEG 
or PDHF-QPT can be attached to form the 
subsequent nanofibers with different types 
of coronae (5). 

It is interesting to consider the origin 
of the large exciton diffusion lengths re- 
ported by Jin et al. Prior work has dem- 
onstrated the ability to realize improved 
exciton transport in crystalline materials 
by reducing energetic disorder and exciton 
trapping while also permitting a more op- 
timized orientation of molecular transition 
dipole moments for excitonic energy trans- 
fer (6). Micrometer-scale exciton diffusion 
lengths have been previously reported in 
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discrete single crystals, although transport 
in these systems can also involve long-lived 
dark excitons (7, 8). The PDHF nanofibers 
reported by Jin et al. are highly crystalline, 
with favorable molecule orientation that 
leads to efficient Forster transfer. Long ex- 
citon diffusion lengths are realized for lu- 
minescent excitons without a contribution 
from long-lived dark states. 

As researchers strive to exert greater 
control over energy transport in conju- 
gated systems, it is clear that progress is 
needed on multiple fronts. The work of Jin 
et al. reinforces the notion that crystalline 
order plays an important role in facilitat- 
ing exciton diffusion. More specifically, low 
disorder and strong n-orbital overlap can 
enable more effective exciton transport. 
However, Jin et al. note that these fac- 
tors alone are insufficient to explain the 
reported long exciton diffusion lengths. 


“The PDHF nanofibers 
reported by Jin et al. are 
highly crystalline, 

with favorable molecule 
orientation that leads 

to efficient Forster transfer.” 


Indeed, they highlight the possible role 
played by exciton delocalization and coher- 
ence. In order to realize paradigm-shifting 
improvements in exciton transport, it is 
essential to fully escape the limitations of 
the diffusive or subdiffusive transport re- 
gimes and realize ballistic transport (9). 
Whether through exciton delocalization 
and coherence, or through the design of 
architectures that remove the normal iso- 
tropy of energy transfer, the realization of 
ballistic transport in thin film would offer 
new frontiers of interesting science, as well 
as new device applications that extend be- 
yond the realm of photoconversion. 
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Cold chemistry with two atoms 


Two atoms react to form a molecule in an optical “beaker” 


By Edvardas Narevicius 


or centuries, chemists have written 

equations representing chemical reac- 

tions by using symbols for atoms and 

molecules; for example, 2H,O + 2Na > 

2NaOH + H,. This short notation 

shows only four reacting particles, but 
even in a classroom demonstration where a 
small piece of sodium is dropped in water, 
the total number of reactants will be on the 
order of Avogadro’s number (~6 X 1073). On 
page 900 of this issue, Liu et al. (1) instead 
study a chemical reaction taking place be- 
tween a minimal number of participants. In 
their experiment, exactly two atoms collide, 
absorb a photon, and form a molecule in 
the excited state. And this time, the reac- 
tion equation, Na + Cs — NaCs* (where the 
asterisk denotes an excited molecule), de- 
scribes exactly the process that takes place 
in the laboratory. 

During room-temperature liquid-phase 
reactions, local interactions and conditions 
vary widely, with many quantum states con- 
tributing to the reaction dynamics. Ensem- 
ble averaging can be helpful for describing 
these reactions, and thermodynamic prop- 
erties such as the Gibbs free energy can be 
used together with classical dynamics cal- 
culations to reliably predict reaction rates 
and outcomes. 

At another extreme are reactions between 
reactants in interstellar space, where particle 
densities are 10 orders of magnitude lower 
than in a liquid and where reactions hap- 
pen only when two reactants collide. Hersch- 
bach, Lee, and co-workers were the first to 
experimentally explore this regime with low- 
density cold molecular beams (2). Removing 
interactions with the environment allowed 
direct observation of reaction mechanisms. 

In the past decade, scientists have been 
able to observe reactions at ever-lower col- 
lision energies. At low collision energies, 
quantum-mechanical wavelike behavior 
plays a central role, necessitating use of 
quantum statistics (3) and consideration 
of quantum tunneling (4) and quantum 
scattering resonances (5-7). Liu et al. now 
measure inelastic collisions between only 
two trapped atoms, achieve formation of 
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an individual molecule, and perform mo- 
lecular spectroscopy. 

It is surprising that the authors could 
construct a molecule by a reaction between 
two atoms. Usually, chemists increase den- 
sities until the collision frequency is high 
enough for products to be formed. Liu et 
al. instead use a clever technique to confine 
the two atoms in a small (several cubic mi- 
crometers) trap that is sufficiently deep to 
keep atoms with a temperature of 1 millikel- 
vin (mK). They create the trap with tightly 
focused laser beams that confine particles 
in the region with the highest laser field 
intensity. Similar to laboratory tweezers, at- 
oms trapped inside can be moved around by 
translating the laser beams in space. 

To be able to place the atoms in optical 
tweezers, the authors first had to lower the ki- 
netic temperature down to tens of microkel- 
vin. To achieve this temperature, the authors 
used one of the most successful tools of mod- 
ern atomic and optical physics, laser cooling 
(8). They laser-cooled Na and Cs atoms in 
individual optical tweezer traps and detected 
them using a fluorescence signal, leading to a 
~33% success rate in the preparation of two 
reactants in a well-defined initial state. 

After cooling the atoms and trapping 
them in individual laser tweezers, the au- 
thors adiabatically merged the two traps 
and formed an optical “beaker” containing 
both Na and Cs. The next step was to show 
and prove that collisions between the two 
atoms are detectable. To do so, the authors 
prepared a beaker containing two atoms in 
which excited hyperfine states were popu- 
lated. Hyperfine states form through inter- 
actions between electron and nuclear spins 
within an atom, with excitation energies on 
the order of 100 mK. During a collision, the 
internal excitation energy can be released 
in the form of kinetic energy, and atoms 
will be ejected out of the shallow optical 
trap. The atoms are lost after several mil- 
liseconds, indicating a high collision rate. 
The inelastic collisions can be strongly sup- 
pressed by controlling the initial quantum 
state of the atoms. If atoms are driven to 
the lowest energy state, their lifetime in the 
trap increases to several seconds. 

The long trapping lifetimes enabled 
the authors to perform the impressive feat 
of building a single molecule out of two 
atoms. In principle, a collision between 
two ground-state atoms cannot produce a 
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bound molecule. A third body is needed to 
carry away the excess bonding energy. Usu- 
ally, interaction with another atom or mole- 
cule serves this purpose. In Liu e¢ al’s study, 
photons take care of energy conservation. 
Two photons are needed to create a mol- 
ecule out of two separate atoms (see the 
figure). The first photon that starts the re- 
action is absorbed by an atom. However, 
the photon energy is tuned just below the 
atomic transition and can only be absorbed 
if a molecular state is reached (see the fig- 
ure). By scanning the laser frequency, the 
authors could detect several bound mo- 
lecular levels on the electronically excited 
potential and assign them to particular mo- 
lecular vibrations, demonstrating that mo- 
lecular spectra can be measured with only 
two atoms. The electronically excited mole- 
cule quickly emits a photon and a molecule 
in a vibrationally excited level. Formation 
of a molecule is detected as a simultane- 
ous loss of fluorescence of the Na and Cs 
atoms. Here again, the authors harness the 
low temperatures that can be achieved with 
laser cooling. The transition probability be- 
tween free atoms and bound molecular state 
depend on the overlap between correspond- 
ing wave functions. To form a molecule, two 
atoms have to spend sufficient time at dis- 
tances on the order of the length of a mo- 
lecular bond. This time can be increased by 
reducing the relative kinetic energies that 
correspond to temperatures in the range 


of several microkelvin. This leads to higher 
excitation probabilities that do not require 
high-intensity laser fields (9). 

Liu et al.’s method opens many inter- 
esting avenues for research. The authors 
plan to demonstrate that with the help of 
an additional laser field, molecules can 
be coherently transferred into a single se- 
lected vibrational level. Such a degree of 
quantum state control would allow studies 
of quantum state-resolved collisions and 
reactions between molecules. The ability 
to work with well-defined numbers of at- 
oms will provide unprecedented precision 
in unraveling the microscopic mechanisms 
of complex interactions. Another interest- 
ing application for “designer” cold mol- 
ecules would be the assembly of a lattice of 
strongly interacting polar molecules that 
could serve as qubits or as a tool to investi- 
gate unusual quantum phases. 
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Making exactly one molecule 
By confining one Cs atom and one Na atom in 
separate molecular tweezers and then merging 
the tweezers to form an optical beaker, 

Liu et al. achieve the formation of an 


individual molecule in a vibrationally Photo- 
excited state. association 
photon 


Photoassociation process 
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Free Na and Cs atoms collide, absorb a photon, and form an electronically excited molecule, which emits another 


photon and forms a stable molecule. 
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SYNTHETIC BIOLOGY 


Illuminating 
dark depths 


Microelectronic processing 
and engineered bacteria 
provide real-time insights 
into the gut 


By Peter R. Gibson and Rebecca E. Burgell 


he inner workings of human physiol- 
ogy remain one of the final frontiers 
of science. In particular, the gastro- 
intestinal (GI) microenvironment is 
incompletely understood. On page 
915 of this issue, Mimee et al. (7) have 
developed an ultralow-power ingestible sen- 
sor of various compounds (ingestible micro- 
bio-electronic device, IMBED) by combining 
engineered bioluminescent bacteria and mi- 
croelectronic processing. This has the poten- 
tial to unlock a wealth of information about 
the body’s structure and function, its rela- 
tionship with the environment, and the im- 
pact of disease and therapeutic interventions. 
Tiny sensors can now be made through 
advances in semiconductor microelectron- 
ics and microfabrication, and their signals 
can be computed and communicated wire- 
lessly to remote receivers (2). Furthermore, 
the increasingly low power demands of such 
devices, developments in battery technology, 
and advances in wireless signal transmission 
have permitted deeper, more durable explo- 
ration within the body (2). Such technology 
can be used to study the luminal content of 
the GI tract, as understanding its physiol- 
ogy and pathology is limited, largely owing 
to difficulties in access. Although devices can 
be inserted into regions close to the exte- 
rior, few provide measurements under truly 
physiological conditions. For example, access 
to the colon generally involves bowel cleans- 
ing, which disrupts normal physiology. Inac- 
cessibility almost precludes such approaches 
to most of the small intestine. This negatively 
affects our comprehension of inflammatory 
intestinal disorders such as Crohn’s disease, 
celiac disease, and irritable bowel syndrome, 
which are prevalent worldwide (3, 4). 
The design of biosensors in the form of 
capsules, no greater in size than a large vi- 
tamin pill, has revolutionized the ability 
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to access the GI tract without the need for 
physiological disruption (5). The addition of a 
miniaturized camera on such devices has en- 
abled endoscopic visualization of the entire 
GI tract and opened the small intestine to 
medical scrutiny and scientific research (2, 5). 
Such devices are now in routine use world- 
wide for diagnosing inflammatory, ulcerative, 
and proliferative lesions and identifying sites 
and causes of bleeding (6). 

With their success has come the demand 
for new investigative technology that not 
only provides an assessment of bowel struc- 
ture but that also expands our understand- 
ing of normal GI physiology and the function 
of nutrients and microbiota in health and 
disease. Microsensors have been developed 
to map temperature, pH, and gas concen- 
trations as the capsule moves down the gut 
(7-10). By utilizing a semipermeable mem- 
brane, gaseous small molecules are sampled 
by microsensors in the capsule, and their 
concentrations are transmitted in real time 
to an external wireless receiver and recording 
device. For example, the observed concentra- 
tions of hydrogen, carbon dioxide, and meth- 
ane give unparalleled insights into the inner 
workings of the microbiota of the intestine 
(8, 9). GI microbiota are key for gut health 
and play critical, although incompletely un- 
derstood, roles in overall health (17). 

Mimee et al. offer a system for detecting 
biomolecules in the GI tract. A probiotic 
bacterium (12) was genetically engineered to 
detect and metabolize heme (a component 
of red blood cells) and to convert heme me- 
tabolites into a luminescent signal. A capsule 
enclosed the bacteria within a semiperme- 
able membrane, and electronic photodetec- 
tors and transmission platforms detected the 
luminescent signal. After capsule ingestion, 
this information was transmitted wirelessly 
in real time to an external receiver and re- 
corder, such as a smartphone (see the figure). 
When tested in vivo in pigs, the ingested cap- 
sule detected heme (that is, lysed red blood 
cells) with high specificity. 

The current application of this prototype 
has limited utility in clinical medicine: Bleed- 
ing from the stomach is readily detected and 
treated in clinical practice (73). However, the 
IMBED prototype provides proof of concept 
that a specific molecule can be detected in 
vivo and this information wirelessly trans- 
mitted outside the body. Indeed, Mimee et ai. 
show that this device can detect other com- 
pounds within the gut, including a biomarker 
of inflammation and specific bacteria. 

The future challenge is to find an appro- 
priate application for the technology. The 
approach of Mimee et al. was to examine a 
molecule (heme) not normally found in the 
GI lumen. A similar situation might apply 
to molecules associated with cancers and 
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inflammation, as well as infection (such as 
a specific microbe). Assessing multiple mol- 
ecules simultaneously might also be advan- 
tageous, although technical issues such as 
whether the bioluminescent system can be 
adapted to different wavelengths need to be 
addressed. There is also potential for robotic 
capabilities to be included in such devices, 
enabling, for example, drug release when the 
analyte is detected (2). 

However, there are limitations. In the 
case of the GI tract, sometimes the great- 
est barrier is that we do not know what we 
do not know. As such, the clinical utility of 
the device is likely to be limited by the rate 
at which biomarkers are characterized. Fur- 


provides the theoretical use of other mol- 
ecules for localization. Whether such a bio- 
sensor can be successfully incorporated into 
the IMBED requires further study. 

Despite these limitations, it is exciting 
to watch where this technology ultimately 
takes us. Although the GI tract is ideal for 
this technology, its application to other 
parts of the body, such as solid organs or 
the circulation, is also theoretically possi- 
ble. The former would require implantation 
and removal. The latter would necessitate 
further miniaturization and the application 
of different structural components, such 
as carbon nanotubes, to provide flexibility, 
strength, and safety (15). 


thermore, for most molecules, it is not their 
presence but rather where and in what con- 
centration they are found that will be key to 
understanding GI physiology. The microen- 
vironment of each segment of the GI tract 
varies, and thus a device that responds to a 
change in analyte concentration is crucial. 
Whether the IMBED system is able to re- 
spond to change over time is uncertain—it 
is not clear whether the bioluminescence 
signal can be reset once activated. 

Critical to next-generation devices will 
be location determination. For optical im- 
aging of the gut, anatomical landmarks can 
be used (5, 6). However, as inclusion of a 
camera results in excessive power needs, 
nonoptical means are necessary. Changes in 
pH appear to be reliable at estimating pas- 
sage through different GI regions (14), and 
a drop in temperature indicates capsule ex- 
cretion (7). The development and validation 
of a gas-sensing capsule (8, 9) alternatively 
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The next generation of biosensors 

The IMBED prototype is proof of principle of the potential for biosensors in capsules 
to detect molecules in the gut, which could expand our understanding of normal 

GI physiology and the function of nutrients and microbiota in health and disease. 
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After capsule ingestion, information is wirelessly 
transmitted in real time to an external receiver 
and recorder, such as a smartphone. 


The IMBED system described by Mimee et 
al. provides another step in delving into the 
dark depths of the GI tract and could lead to 
real-time identification, detection, and quan- 
tification of biomolecules in humans, with 
great potential benefit to clinical medicine 
and research. 
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CANCER MICROBIOME 


Cancer immunity thwarted by the microbiome 


Microbial bile acid metabolites promote liver metastasis 


By Nadine Hartmann! and 
Mitchell Kronenberg’? 


iver cancer rates have tripled since 

1980, making it one of the leading 

causes of cancer deaths worldwide 

(1). The efficacy of a variety of can- 

cer treatments, as well as carcino- 

genesis itself, can be influenced by 
the microbiome (2). Microbiome-cancer 
interactions are complex, as the microbi- 
ome can promote chronic inflammation 
or directly affect cancer cells. Further- 
more, by influencing the immune system, 
the microbiome may either promote or 
inhibit antitumor immune responses (3). 
On page 876 of this issue, Ma et al. (4) 
show that a type of commensal bacteria, 
Clostridium species, prevents an effective 
immune response to both primary liver tu- 
mors and liver metastases by a previously 
unknown mechanism: not by means of its 
own metabolites, but by modifying a host 
product—the bile acids that liver cells are 
exposed to. 

About 75% of the blood that passes 
through the liver is supplied from the in- 
testine by the portal vein, thus constantly 
exposing the liver to components and me- 
tabolites from the intestinal microbiota. 
To avoid excessive immune stimulation, 
at steady state, hepatic tissue promotes 
immune tolerance rather than immune 
activation (5). The liver has a mostly non- 
circulating immune system consisting of T 
lymphocytes, B lymphocytes, and macro- 
phages, among others (6). In the absence 
of infection or injury, these immune cells 
are predominantly in the sinusoidal blood 
vessels rather than the tissue parenchyma. 
The liver-resident lymphocyte populations 
have different properties from their circu- 
lating counterparts. For example, natural 
killer T (NKT) cells constitute up to 40% 
of the total intrahepatic T lymphocytes in 
mice, but they are much less frequent in 
other sites. 

Although most T lymphocytes recognize 
peptides bound to or presented by anti- 
gen-presenting molecules encoded by the 
highly polymorphic major histocompatibil- 
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ity complex (MHC) genes, NKT cells recog- 
nize lipids presented by the monomorphic 
MHC class I-like protein CDld (7). NKT 
cells are distinguished by their rapid and 
intense cytokine responses: Intrahepatic 
NKT cells produce copious amounts of in- 
terferon-y (IFN-y) when activated, which 
has been associated with their ability to 
respond to tumors (8). 

To understand the role of the microbi- 
ome in hepatic cancers, Ma et al. analyzed 
primary liver tumors in transgenic mice as 
well as liver metastases from lymphoid or 
melanoma tumor cell lines engrafted out- 
side the liver. All mice received a cocktail 
of three antibiotics to minimize the mi- 
crobiome. Antibiotic treatment reduced 
tumor growth or liver metastases, but not 
metastases to other sites, such as the lungs. 
This tissue-specific effect correlated with an 
increase in activated liver NKT cells. Fur- 
thermore, tumor-bearing CD1d-deficient 
mice were unaffected by antibiotic treat- 
ment. These mice lack not only NKT cells 


Primary bile acids promote 


liver tumor immunity 
Clostridium species modify bile acids, which prevents 
antitumor immune responses in the liver. 
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but also other CD1d-reactive cells in the 
liver. The accumulation of NKT cells in the 
liver requires expression of the chemokine 
receptor CXCR6 (C-X-C chemokine recep- 
tor type 6), which binds the ligand CXCLI16 
expressed on the surface of liver sinusoidal 
endothelial cells (LSECs) (see the figure). 
Antibiotic treatment increased LSEC ex- 
pression of CXCL16 by an unknown mecha- 
nism. In CXCR6-deficient mice, antibiotic 
treatment did not augment tumor growth, 
suggesting that the increased antitumor re- 
sponse following antibiotics was due to en- 
hanced NKT cell accumulation in the liver 
mediated by CXCL16-CXCR6 interactions. 

How does the gut microbiome signal to 
LSECs? Bile acids—which are synthesized 
in the liver, stored in the gall bladder, and 
secreted into the intestine—are involved in 
dietary fat absorption. Recently, bile acids 
have been found to shape the intestinal 
microbiome composition, but microbiome 
metabolism also affects bile acid profiles 
(9). Primary bile acids that are synthesized 
in the liver stimulate hepatocytes to pro- 
duce proinflammatory cytokines and che- 
mokines like CXCL16, thus contributing to 
the recruitment of CXCR6* lymphocytes to 
the liver (10). However, primary bile acids 
are metabolized to secondary bile acids by 
intestinal bacteria and are returned to the 
liver by the enterohepatic circulation (9). 
Ma et al. showed that LSECs up-regulate 
Cxcl16 messenger RNA (mRNA) produc- 
tion when exposed to primary bile acids, 
whereas Cxcl16 mRNA production was de- 
creased upon secondary bile acid exposure. 
Also, treatment with antibiotics depleted 
the main producers of secondary bile acids 
and increased CXCLI6 expression in LSECs. 
Importantly, increasing primary bile acids 
(for example, through feeding) increased 
intrahepatic NKT cells and reduced tumor 
metastases. Because the selective use of an- 
tibiotics implicated Gram-positive bacteria, 
and because Clostridium species carry out a 
major enzymatic step in secondary bile acid 
formation, Ma et al. determined that coloni- 
zation with a commensal Clostridium spe- 
cies led not only to decreased intrahepatic 
NKT cells but also increased liver tumor 
metastases. Interestingly, it has been shown 
that a high-fat diet-induced increase of in- 
testinal Clostridium species increases levels 
of secondary bile acids in the liver, which in 
turn promotes liver cancer (11). 
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Why is a specialized T cell subset critical 
for immune protection from liver metas- 
tases? Although NKT cells recognize lipids 
containing fatty acids, bile acids have an en- 
tirely different structure and are not known 
to be antigens. Therefore, the role of pri- 
mary bile acids in NKT cell activation is in- 
direct, by promoting NKT cell accumulation 
but not their activation. Ma et al. analyzed 
CDid-expressing tumors, and they suggest 
that CD1d-presented lipid antigens from the 
tumors could be directly recognized by NKT 
cells. It is also possible that NKT cells could 
be activated by lipid antigens from CD1d- 
tumors presented by CDid* macrophages 
or dendritic cells or even by inflammatory 
cytokines such as interleukin-12 (IL-12) (72). 
Regardless, the prevalence of NKT cells in 
mouse liver and their strong effector func- 
tion allow them to be a dominant player in 
the anticancer response of this organ. The 
situation is different in humans, as intra- 
hepatic NKT cells are much less frequent. 
However, mucosal-associated invariant T 
(MAIT) cells, which recognize microbial 
vitamin B metabolites presented by MHC- 
related protein 1 (MRI), are prevalent in 
human liver, and they also exert rapid ef- 
fector functions, such as IFN-y secretion 
(7). Interestingly, MAIT cells also express 
CXCR6, and, although their antitumor po- 
tential is uncertain, they do infiltrate some 
metastatic lesions in the liver of colorectal 
carcinoma patients (73). 

Although there are bile acid profiles that 
are associated with cancer, several studies 
advocate modifying bile acid production 
to improve cancer therapy, for example, 
through agonists of bile acid receptors or 
simply by changes in diet (74). Thus, as for 
most biological systems, bile function has 
different facets. It is designed to emulsify 
fats and help absorb dietary lipids, but after 
modification by the microbiome, secondary 
bile acids alter immune function to pro- 
mote liver cancer and liver metastases. 
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The RNA face 


of phase separation 


RNA regulates the formation, identity, 
and localization of phase-separated granules 


By Magdalini Polymenidou 


hase separation or liquid unmixing—a 

phenomenon resembling the forma- 

tion of oil droplets in vinegar—has 

emerged as a major driver of func- 

tional compartmentalization within 

cells, allowing the rapid and dynamic 
isolation of specific activities from the sur- 
rounding cellular environment, without the 
need of a membrane (J). A flurry of exciting 
recent studies demonstrated the importance 
of RNA binding proteins (RBPs) with low 
complexity regions (LCRs) in this process [for 
example, (2, 3)], yet the role of incorporated 
RNAs is less well understood. On pages 918 
and 922 of this issue, Maharana et al. (4) and 
Langdon et al. (5) propose that cellular RNAs 
regulate the formation, subcellular localiza- 
tion, and identity of these granules, which 
has implications for stress responses and 
pathologic protein aggregation in neurode- 
generative diseases. 

Cellular stressors induce the formation of 
perhaps the best studied of these so-called 
membraneless organelles, stress granules 
(SGs), which consist of multiple RBPs in 
complex with cytoplasmic RNAs. SGs stall 
the translation of incorporated RNAs and 
contain RBPs with LCRs—also called prion- 
like domains—which can _phase-separate, 
both in vitro and within cells (1-3). Several of 
these SG-incorporating prion-like RBPs form 
pathologic protein aggregates in affected 
neurons of patients with neurodegenerative 
diseases. The same proteins form phase- 
separated liquid droplets in vitro, which are 
dynamic in a similar manner to SGs in cells. 
These liquid droplets transform over time 
into solid-like structures (7-3), somewhat re- 
sembling the pathologic protein aggregates 
in patients with neurodegeneration. These 
observations have reinforced the hypothesis 
that SGs act as precursors of pathologic ag- 
gregates (6, 7). How this transition occurs in 
the degenerating brain remains unclear, and 
a direct in vivo demonstration is still missing. 

A main driver of protein phase separation 
is concentration, so Maharana et al. asked 
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whether the intrinsic cellular concentrations 
of RBPs known to form membraneless or- 
ganelles could explain this behavior. Surpris- 
ingly, they found that in the absence of any 
other components, several RBPs implicated 
in neurodegeneration phase-separate in vi- 
tro in concentrations similar to those physi- 
ologically found in the nucleus. Yet inside 
the cell, the same proteins remain diffused 
in the nucleus and are typically excluded 
from phase-separated compartments, even 
under cellular stress conditions, which drive 
their condensation into cytoplasmic SGs. 
Searching for which nuclear component may 
prevent the phase separation of RBPs, they 
found that high RNA:protein ratios—like 
those typically found in the nucleus—pre- 
vent phase separation, whereas low ratios, 


“high RNA:protein ratios... 
prevent phase separation... 
low ratios...promote it...” 


similar to those in the cytoplasm, promote 
it (see the figure). This effect seems to be 
independent of sequence specificity for RBP 
binding. How these RNAs keep RBPs from 
phase-separating in the absence of specific 
binding and what the role of LCRs is in this 
process remain unresolved. Interestingly, 
Maharana et al. show that in vitro high 
amounts of short, nonspecific RNAs keep 
prion-like RBPs soluble. Conversely, longer 
RNAs that can form higher-order second- 
ary structures and which specifically bind 
RBPs can nucleate phase separation. This 
is potentially the reason that Neati (nuclear 
paraspeckle assembly transcript 1), a long 
noncoding RNA with specific FUS (fused in 
sarcoma) binding sites (8), can capture FUS 
out of the diffused nuclear pool to form para- 
speckles (phase-separated nuclear compart- 
ments that regulate gene expression). 
Maharana et al. propose that these obser- 
vations may explain the somewhat prefer- 
ential phase separation of nuclear proteins 
into cytoplasmic SGs, which in turn may 
transform into aggregates in the degenerat- 
ing brain. Cytoplasmic aggregates are toxic 
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to cells because they interfere with major cel- 
lular functions, including nucleocytoplasmic 
transport (9), so promoting their removal is a 
focus of drug development for neurodegener- 
ative diseases. Because many of the key pro- 
tein components of cytoplasmic aggregates 
play essential cellular functions, therapeutic 
approaches aimed at their transcriptional 
silencing are not suitable. Instead, regulat- 
ing their transition to pathologic aggregates 
might be a viable strategy, and a recent study 
showed that interfering with the kinetics of 
SG formation may reverse neurodegenerative 
processes associated with TDP-43 (TAR-DNA- 
binding protein of 43 kDa) aggregation (0). 
The observation that small nonspecific RNAs 
solubilize FUS and prevent its phase separa- 


ferent RNA species and levels of Whi3 pro- 
tein (5, 17). Langdon et al. show that RNA 
secondary structure determines their inter- 
action with other RNAs through exposure or 
masking of complementary sequences, form- 
ing double-stranded RNA, as well as specific 
Whi3 RNA binding sites. Importantly, the 
two types of Whi3-containing granules do 
not mix, either in vitro or in cells, support- 
ing their distinct structures. These findings 
suggest that RNA secondary structure delin- 
eates the complex composition of granules 
through the formation of specific RNA-RNA 
and RNA-RBP interactions. 

Both studies point to RNA secondary struc- 
ture—Neatl or Whi3-interacting RNAs—as 
a major navigator of the formation, identity, 


RNA controls phase separation 


High nuclear RNA concentrations prevent phase separation of RBPs in a nonspecific manner. Long RNAs 
with stable secondary structure can capture RBPs out of solution, leading to phase-separated nuclear 
compartments such as paraspeckles and cytosolic granules. The exposure of specific RBP binding sites and 


complementarity stretches within RNA secondary structures regulates 
the recruitment of other RNAs and RBPs within granules. 
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tion both in vitro and in cells (4) now opens 
up possibilities for exploring RNA mimetics 
as an alternative strategy to counteract the 
pathologic solidification of RBPs. 

The role of RNA secondary structure and 
RBP binding specificity in defining the iden- 
tity and subcellular localization of phase-sep- 
arated granules is the focus of the Langdon 
et al. study. They investigated two distinct 
RNA granules of yeast cells containing the 
Whi83 protein, the phase separation of which 
was shown to control cellular memory. Whi3 
protein phase-separates via its RNA binding 
motif (RRM) and its polyglutamine (polyQ) 
tract and forms granules localized either 
near the nucleus or toward cellular growth 
tips. These two types of granules contain dif- 
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and composition of phase-separated gran- 
ules, and both groups speculated that these 
principles are likely to be generally applica- 
ble for other granules, including cytoplasmic 
SGs. In line with this, RNA-RNA interactions 
were recently shown to contribute to the for- 
mation of SGs and define their overall RNA 
composition (12). Moreover, the protein com- 
position of SGs varies among different cell 
types, with neurons demonstrating the high- 
est complexity (73). In this RNA-centric view 
of phase separation, a key question now is 
whether the specific cytoplasmic RNA pool 
of each cell type drives the disparity in the 
protein composition of SGs. 

Another related outstanding question is 
what defines the specificity of protein com- 
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ponents in pathologic aggregates? Even 
though many different proteins are local- 
ized to physiologic SGs, only a few of them 
persist in pathologic aggregates. Moreover, 
different RBPs comprise the pathologic ag- 
gregates in specific disease subtypes, even 
in the absence of disease-causing muta- 
tions. For example, TDP-43 is the main 
aggregated protein in a large subset of 
patients with frontotemporal dementia 
(FTD), whereas FUS seems undisturbed in 
the same neurons. Conversely, FUS forms 
cytoplasmic aggregates in a distinct group 
of FTD patients that do not show TDP-43 
aggregates. In view of the studies from Ma- 
harana et al. and Langdon et al., the distinct 
RNA binding specificities of TDP-43 (74) and 
FUS (8) may account for their preferential 
incorporation into diverse granules before 
their conversion into pathologic aggregates. 
Which are these specific RNAs and whether 
they remain trapped within the pathologic 
assemblies are key questions. 

It will be important to further dissect the 
role of RNA in the formation and identity of 
different phase-separated RBPs and to un- 
derstand the rules that regulate their differ- 
ential composition within a cell and among 
different cell types. Recently, long RNAs as- 
sociated with repeat expansion disorders, 
including Huntington’s disease, muscular 
dystrophy, and amyotrophic lateral sclerosis, 
were shown to phase-separate independently 
of RBPs (75). In view of the studies by Maha- 
rana et al. and Langdon et al., the accumu- 
lation of foci containing repetitive RNA in 
cells from patients with neurodegeneration 
may trigger a broad imbalance in the overall 
RBP:RNA ratios and interactions and a more 
general misregulation of phase separation in 
nuclear and cytoplasmic compartments than 
previously understood. 
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Disparities in science literacy 


Cognitive and socioeconomic factors don’t fully explain gaps 


By Nick Allum’, John Besley”, 
Louis Gomez**, Ian Brunton-Smith* 


uch is known about how adult sci- 

ence literacy varies internationally 

and over time, and about its associa- 

tion with attitudes and beliefs. How- 

ever, less is known about disparities 

in science literacy across racial and 
ethnic groups (1). This is particularly surpris- 
ing in light of substantial research on racial 
and ethnic disparities in related areas such as 
educational achievement, math and reading 
ability (2), representation in science, technol- 
ogy, engineering, and math (STEM) occupa- 
tions (3), and health literacy (4). Given the 
importance of science literacy to securing 
and sustaining many jobs, to understand- 
ing key health concepts to enhance quality 
of life, and to increasing public engagement 
in societal decision-making (5), it is concern- 
ing if the distribution of science literacy is 
unequally stratified, particularly if this strati- 
fication reflects broader patterns of disadvan- 
tage and cultural dominance as experienced 
by minorities and educationally underserved 
populations. We describe here such dispari- 
ties in science literacy in the United States 
and attempt to explain underlying drivers, 
concluding that the science literacy disadvan- 
tage among black and Hispanic adults rela- 
tive to whites is only partially explained by 
measures of broader, foundational literacies 
and socioeconomic status (SES). 

The main source of evidence about U.S. 
patterns and trends in science literacy is the 
National Science Board’s Science and Engi- 
neering Indicators (SEI) survey module (3), 
administered biennially since 2006 to a sub- 
sample of respondents for the General Social 
Survey (GSS), a high-quality biennial survey 
that seeks to provide a representative picture 
of American adults (aged 18 or older). Science 
literacy is captured by questions covering 
basic scientific facts and processes, but sub- 
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group analysis is only presented in SEI for 
gender, age, education, and income because 
sample sizes do not permit more granular 
analysis. Research using different questions 
(6) found that white Americans score more 
highly than blacks and Hispanics, although 
sample sizes for black and Hispanic groups 
were relatively low and only bivariate analy- 
sis was presented. Other work (7) found that 
black Americans reported lower confidence 
in science, even after adjusting for attitudinal 
and demographic factors, though this study 
did not look at science literacy. 

In contrast to the sparse research on race 
and adult science literacy, there is voluminous 
evidence of racial inequalities in educational 
measures of children’s science knowledge (8). 
Moving beyond narrow science literacy to 
health literacy and foundational reading lit- 
eracy, we see similar strati- 
fied patterns where white 
Americans do better than 


use a standard set of demographic controls, 
including gender, birth cohort, geographical 
region, education, income, and religion. We 
also employ the following covariates that we 
hypothesize could account for between-group 
literacy differences. 

We use a particularly rich measure of SES, 
the Cambridge Social Interaction and Strati- 
fication (CAMSIS) scale (11), that represents 
differences in status, prestige, and economic 
advantage, based on respondents’ occupa- 
tional groups. This measure is useful as it 
reflects the kinds of personal networks, so- 
cial class, and cultural milieu, in which views 
about science develop and which may overlap 
with racial inequality. Further, we might sur- 
mise that a largely white teaching force has 
often failed to understand contexts of social 
life and interests of black and Hispanic stu- 
dents, to connect those interests to scientific 
phenomena, and to support scientific literacy 
about the phenomena. This neglect may lead 
to different levels of science literacy even 
given equivalent formal qualifications (72). 

Some minority groups express less trust 
and confidence in science compared to 
whites (7). Low confidence in science argu- 
ably could lead to lack of science engagement 
in various settings, thus lower knowledge 
scores. We therefore include measures that 
ask people how much “confidence” they have 
in the “scientific commu- 
nity” and how positive they 
are about science. 


6 ° 
blacks and Latinos, with -..educational One potential benefit of 
seaieoe a across interventions need ae oe pees be in 
groups (9). reducing the gap between 

In the present study, our to measure...not the information-rich and in- 
first oo is cena just the quantity ea ae oe 
racial and ethnic disparities A . edgeable individuals are 
in science literacy among of instruction... often able to acquire infor- 
adults in the United States. but also qua lity. 9 mation more effectively, so 


We estimate these by com- 
bining data from six waves 
of the GSS between 2006 and 2016 (n = 2339). 
We take it as axiomatic that the explanation 
for such disparities must be found in socially 
determined factors that fall differentially 
on different groups. Our second objective 
is therefore to investigate plausible factors, 
including demographics, foundational lit- 
eracy, attitudes, and access to information 
that could account for such disparities [see 
supplementary materials (SM) for details on 
all data and analyses]. 

The GSS science survey module includes 
multiple choice (mostly true/false) questions 
about science content and process, along 
with open-ended questions. We regard these 
questions as indicators of the broader con- 
struct of science literacy. To measure founda- 
tional literacy, we rely on a well-established 
measure of verbal ability, Wordsum, that has 
been included in the GSS since 1974 (10). We 
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the internet may exacer- 
bate knowledge gaps (13). 
We include a question on whether respon- 
dents have sought science information on 
the internet. 

Pooling the samples across all years yields 
a mean science literacy quiz score for whites 
of 8.6 (out of a maximum possible 13), His- 
panics 6.8, and blacks 6.5. A one-way analy- 
sis of variance shows statistically significant 
differences between groups (F = 283; df = 3). 
The overall mean for all groups combined is 
8.0 with a standard deviation of 2.7; the av- 
erage difference between whites and the two 
racial and ethnic groups is quite substantial, 
at around two-thirds of a standard deviation. 

We tried to gain better understanding of 
these disparities by adjusting for potential 
confounding factors. We fit several multivari- 
ate ordinary least squares regression models 
with science knowledge as the dependent 
variable. The first model included indicators 
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for race and ethnicity alongside demographic 
variables: sex, birth cohort, region of resi- 
dence, college education, and income. We also 
include SES and number of science courses 
taken at high school or college level. If science 
literacy disparities are due to these factors, 
then statistically adjusting for them should 
attenuate or remove any residual differences 
among the groups. In the second model, we 
added our measure of foundational literacy. 
If this eliminates race or ethnicity differ- 
ences, it would support the hypothesis that 
broader literacy disparities lie behind science 
literacy disparities. In the third model, we 
added attitude toward science, confidence 
in science, and science internet use, to see if 
these more proximal features of orientation 
to science and technology might lie behind 
science literacy differences. 

Even after adjusting for demographic 
variables, science knowledge disparities are 
only partially reduced. There is still around 
a 1.5-point difference in the average scores of 
black and white Americans [see the figure, 
model 1 (blue)]. The gap for Hispanics is nar- 
rower, but their mean is still about a point 
lower than that for whites. Most of the other 
included predictors are significantly associ- 
ated with science literacy and consistent with 
extant research (fig. S1 and table S1). Model 
1 accounts for about 20% of the variance in 
science literacy. 

Adding foundational literacy to the model 
shrinks the coefficients for both minority 
groups. Residual gaps are just over one point 
for blacks and a half point for Hispanics [see 
the figure, model 2 (red)]. This model ac- 
counts for just under 22% of the variance. 
The hypothesis that disparities in founda- 
tional literacy account for gaps in science 
literacy receives some support, but there is 
much left to explain. Adding behavioral and 
psychological variables increases the amount 
of variance explained by the model to 30%, 
but we see little change in the race and eth- 
nicity coefficients [see the figure, model 3 
(green)]. Race and ethnicity continue to mat- 
ter even when comparing the science literacy 
of people with similar science attitudes. 

Overall, disparities in science literacy 
cannot be straightforwardly “explained” by 
intergroup differences in the levels of our 
measured characteristics. We performed 
a decomposition analysis and found that 
whereas around one-third of the variation 
in knowledge scores is explained by the in- 
dependent variables, only about half of the 
total race and ethnicity gaps are explained by 
these observables. For both blacks and His- 
panics, differences in foundational literacy 
compared to whites are the most important 
of the observable influences on the size of the 
disparity. The remaining, unexplained, por- 
tion of the gap must in large part be driven 
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Change in science knowledge 
score per unit change 


in independent variables 
Unstandardized regression coefficients. Model 

1= demographic variables; model 2 = model 1 + 
foundational literacy; model 3 = model 2 + behavioral/ 
psychological variables. Effect of black/Hispanic/ 
other is compared to white. Effect of college or above 
is compared to less than college. The zero line means 
no effect. See full results in the SM. 
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by variables not in our models. Translating 
confidence in, and positive attitudes toward, 
science into higher science literacy appears 
to be less common for blacks and Hispanics 
than for whites, for unclear reasons. 

A principal question we wanted to address 
was the extent to which ethnic and racial 
inequalities in science literacy are simply 
reflections of well-established disparities in 
more fundamental axes of disadvantage, in- 
cluding broader foundational literacies. They 
are not. When we compared whites with 
black and Hispanic respondents who hold 
similar attitudes toward science and have the 
same degree of confidence in its institutions, 
we still find persistent disparities in science 
literacy. We do not claim to have captured all 
of these disadvantages in our analysis, as our 
variables are measured with error and are 
relatively broad-brush, but we have at least 
included key dimensions. 

This analysis invites the question as to 
what could be responsible for the remaining 
gaps. It may be that the specific knowledge 
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questions asked or the survey response con- 
text favors whites, but, more important, we 
suspect that our education measures mask 
considerable heterogeneity in the experi- 
ences of children, young people, and adults 
of different races and ethnicities. Graduat- 
ing from high school, earning a college de- 
gree, or taking a science class can consist in 
a wide variety of experiences, some of which 
are likely correlated with race and ethnic- 
ity. As recently as the early 1970s, black and 
Hispanic children were much more likely to 
attend schools funded at a lower than aver- 
age rate and intentionally segregated by eth- 
nicity, and such segregation has continued 
de facto to varying degrees (J4). Microsocial 
experiences of nondominant groups in any 
learning environment—for example, stereo- 
type threat and racial microaggressions—can 
shape learning experiences (15). Although we 
adjust for educational qualifications, we do 
not capture differences in the quality of edu- 
cation experienced by blacks and Hispanics. 

This suggests that educational interven- 
tions need to measure, and target, not just 
the quantity of instruction and formal quali- 
fications, as we do here, but also quality. We 
may also be able to craft training and pub- 
lic awareness campaigns to help scientists, 
teachers, and employers to be more sensitive 
to the subtle manifestations of bias. What- 
ever the remedy, ignoring science literacy 
disparities among underserved groups does 
not serve science or society well. 
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HUMAN HEREDITY 


Our inheritance 


A fascinating history of heredity research reveals 


the field’s highs and lows 


By Axel Meyer 


ith hundreds of ancient human 
genome sequences at our finger- 
tips and millions of contemporary 
samples provided by customers 
of consumer genetics companies, 
now—more than ever before—we 
are able to discover, decipher, 
and interpret mixing, migration 


Has Her Mother’s Laugh, with a 


CARL ZIMMER 


Zimmer begins with a personal tale of a 
visit to a genetic counselor that preceded 
the birth of his first child. This experience 
reveals how little he actually knew about he- 
redity at the time. Later, he seems to “hear 
heredity’—his daughter’s laugh is so similar 
to his wife’s—but why? And what makes his 
second daughter so different from the first? 

Interesting historical tidbits 
are peppered throughout the 


to a legal concept concerning 


events, and genetic variants in oT a book. The term “hereditas,” Zim- 

human populations. Into this Lie mer reveals, for example—em- 

zeitgeist enters Carl Zimmer’s S MOTHER'S T= ployed by the Romans—referred 

most enjoyable new book, She red not to biological inheritance but 
The Powers, 


sweeping overview of the history 
of our understanding of heredity. 


who would inherit a relative’s 


assets. Genealogies, we learn, 


From Mendel’s peas and So- She be s Her evolved in their depiction of bio- 
: : Mother’s Laugh : : : 
viet-era Lysenkoism to chimeras, The Powers logical connections from verti- 


CRISPR, and the microbiome, 
the book traces the scientific dis- 
covery of genes, chromosomes, 
and epigenetics, as well as recent 
advances in gene therapy and pa- 
leogenomics. It is ambitious in scope, but 
Zimmer is one of the best science journalists 
of our times, with a long history of setting 
the bar for beautiful, clear, and scholarly 
writing. He is true to form in this book. 
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Perversions, and 
Potential of Heredity 


Carl Zimmer 
Dutton, 2018. 672 pp. 


cal lines to treelike structures, 
which, to the French, resembled 
the forked pé de grue (“crane’s 
foot,” later rendered into “pedi- 
gree” in English). 

In the 14th through the 16th centuries, 
Europeans began to conceive of the link 
between generations as being related to the 
blood, although other cultures maintained 
very different ideas. Native populations on 
the Malaysian island of Langkawi, for exam- 
ple, continued to believe that kinship was 
achieved by sharing foods. 

Evolutionary biology is Zimmer’s specialty: 
He coauthored one of the best textbooks in 
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In his daughter’s laugh—so similar to her mother’s— 
Zimmer wondered if he was “hearing heredity.” 


the field, and it shows (/). But evolution and 
heredity are not the same, both constituting 
huge fields that only partially overlap. 

Charles Darwin famously misunderstood 
inheritance, proposing that tiny particles 
shed from an organism’s many cells mi- 
grated to the gonads, where they were trans- 
ferred to offspring. His cousin Francis Galton 
did further disservice to the field of heredity, 
having conceived of the notion of “eugenics”: 
a program whereby strategically arranged 
marriages would result in “better” humans 
from generation to generation. Ironically, 
Nazi bombs destroyed the Galton labora- 
tory during World War II. However, the lab’s 
research continued, becoming part of Uni- 
versity College London’s Department of Ge- 
netics, Evolution, and Environment in 2013. 

“Individual ‘Z,” as Zimmer refers to him- 
self after having his own genome sequenced, 
consults experts in New Haven and New 
York to help him interpret the results. His 
Y-chromosome haplotypes, he learns, are 
shared with Napoleon’s (a lock of hair was 
saved after the general’s death, allowing 
French researchers to extract some genetic 
information in 2011). Luckily for Zimmer, 
his results were otherwise rather boring. 

The book reminds one how little informa- 
tion is typically transmitted between gen- 
erations in many families. Often, we don’t 
know where our ancestors were born, what 
diseases plagued them, or how long they 
lived. Genome sequencing stands to shed 
light on some of these genealogical ques- 
tions. Now we can say, for example, that we 
are 43% Ashkenazi Jewish or put a name 
to a mysterious malady that plagued our 
mother’s sister. 

When one writes about genetics and he- 
redity in humans, one runs the danger of ig- 
niting societal short fuses. Zimmer does not 
shy away from controversies, but he does, 
artfully, ship around some of the shallows. 
He points out, for example, how historically 
transient, arbitrary, and scientifically wrong 
it is to classify humans based on skin colors. 
“[S]kin color,’ he writes, “is not a timeless 
hallmark of human races. It has changed in 
different places and at different times.” 

She Has Her Mother’s Laugh is more than 
a historical account of the field of genetics. 
It’s a treasure trove of curious facts, contex- 
tual tidbits, and up-to-date reports on the 
trials and tribulations of heredity told in a 
most entertaining way. & 
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Truth with a vengeance 


A filmmaker with an ax to grind takes aim at 


Thomas Kuhn's legacy 


By Joseph D. Martin 


homas Kuhn, an American physicist, 
historian, and philosopher of science, 
achieved prominence with the 1962 
publication of his influential book, 
The Structure of Scientific Revolu- 
tions. Science, it contended, does not 
generate incrementally truer descriptions 
of reality but develops through 
radical paradigm shifts. Kuhn’s 
theory of scientific revolutions 
motivated expansions and reac- 
tions that reconfigured the his- 
tory and philosophy of science. 
Errol Morris, an acclaimed 
documentary filmmaker who 
briefly studied under Kuhn at 


not bedevil our linguistic reference to things. 
Morris adjudges Kripke’s philosophy a firm 
foundation for scientific knowledge, setting 
up a choice: “Does science progress toward 
a more truthful apperception of the physi- 
cal world? Or is it all a matter of opinion, a 
sociological phenomenon that reflects con- 
sensus, not truth?” Hard-nosed scientific 
certitude or frothy-lipped relativism? 

This dyad, too neat to encapsu- 
late Kuhn’s legacy, recalls Morris 
Zapp, the bumptious protagonist 
of David Lodge’s Changing Places. 
Zapp had a mission: a series of 
Jane Austen commentaries so 
panoptic “that when each com- 
mentary was written there would 
be simply nothing further to say 


Princeton, instead remembers the a ae about the novel in question.” 
acerbic philosopher for flinging Sane Acai : The joke, of course, is the per- 
the titular ashtray at him dur- Errol sh ah fect totality with which any such 


ing an argument in 1972. Morris 
hurls The Ashtray back, targeting 
not Kuhn’s person but his leg- 
acy. “Our posttruth moment is all Thomas 
Kuhn’s fault” would be an unfair summation 
of the book—but only just. 

Structure, Morris writes, brutalized the 
truth, and Kuhn was an avatar of postmod- 
ernism who rejected the external world’s 
very existence. Deep personal animosity—so 
overt that he compares Kuhn to Hitler in his 
acknowledgments—fuels Morris’s critiques. 

A filmmaker’s sensibility informs the 
book’s cadence; rather than demonstrating 
Kuhn’s errors, Morris conjures an image of 
his wrongness. He interweaves philosophi- 
cal explication, cultural allusion that ca- 
reens between the highbrow and the low, 
interviews with authoritative figures, and 
copious illustrations. Incommensurability 
is his principal target. 

Kuhn’s notion that adherents of one 
paradigm inhabit a different mental world 
from, and cannot communicate meaning- 
fully with, those of another is the most radi- 
cal element of his philosophy. Morris, who 
favors Saul Kripke’s causal theory of refer- 
ence, considers incommensurability to be 
epistemic anarchy. 

For Kripke, theoretical commitments do 
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endeavor escapes the point. We 
don’t study literature in search 
of finality but to enable new 
ways of thinking. Daniel Dennett makes a 
similar suggestion about philosophy, asking 
whether it would be preferable to be so de- 
finitively correct that you foreclose an area 
of inquiry or to err so spectacularly that 
people debate your blunders for centuries. 
Scientists tend to choose the former, phi- 
losophers the latter. 

Kuhn had many fecund failures. Struc- 
ture, for all its flaws—many of Morris’s 
specific critiques are apt—forced new ap- 
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proaches to questions about how scientific 
truth claims arise in practice. Morris might 
even have been inclined to agree, if not 
for the depth of his animus. “The role of a 
good documentary,’ he once told the Boston 
Globe, “is not to convince you about what 
happened, but to force you to think about 
what happened.” 

Theories of reason and reference describe 
the clean, ideal ways that our minds might 
latch onto the world. But the practice of sci- 
ence is far messier. Kuhn sensitized histori- 
ans and philosophers to that messiness. He 
wondered how philosophy would have to 
adapt if it countenanced the historical pro- 
cesses by which scientists actually achieved 
understanding. Close scrutiny reveals those 
to be profoundly dissimilar to the tidy phil- 
osophical frameworks Morris favors, which 
are little informed by, and have scarcely 
influenced, the practices that provide the 
sturdiest accounts of the natural world. 

Morris charges Kuhn with undermining 
the truth and authority of science. But that 
authority, the crux of cultural battles over 
climate change and evolution, has less to 
do with abstract, transcendent truth than 
with reliability, consensus, and the social 
processes that enable them. 

The ironclad truth Morris seeks, so inte- 
gral to the popular image of science, is one 
of science’s greatest vulnerabilities. A re- 
sponsible scientist will always admit doubt. 
Combined with the widespread perception 
that science seeks irreproachable truth, that 
is a powerful weapon for those who would 
undermine scientific authority. 

Thomas Kuhn was unkind to Errol Morris. 
By all accounts, he was often unkind. That 
kindness and influence so frequently fail to 
intersect is a bitter injustice. But so, too, is 
vilification in the service of vendetta. 
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Inventing Ourselves 

The Secret Life of the Teenage Brain 
Sarah-Jayne Blakemore 
PublicAffairs, 2018, 256 pp. 


Elusive, evasive, and uncommunica- 
tive, the human adolescent is 
among the most enigmatic subjects 
ever to be studied. This week on 

the Science podcast, Sarah-Jayne 
Blakemore delves into the teenage 
brain, revealing the extraordinary 
features that define this transitional 
state in human development. 
www.sciencemag.org/podcasts 
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Brazil’s government 
attacks biodiversity 


To address the rapid and massive loss of 
biodiversity worldwide, scientific research 
must inform agile decision-making. The 
political leaders of Brazil, the country with 
the planet’s greatest biological wealth (1), 
continue to undermine this goal. In 2001, 
citing the laudable objective of prevent- 
ing biopiracy, the government created the 
Genetic Heritage Management Council 
(CGen) (2), making established biodiver- 
sity research activities illegal overnight by 
imposing severe limitations on access and 
shipment of specimens and samples. After 
5 years of pressure from the academic com- 
munity, the government finally resolved 
to exempt several areas of basic research 
from the restrictive policy (3). Now, after 
severe budget cuts (4, 5) and legal maneu- 
vers directly affecting biodiversity (6), the 
Brazilian Federal Government has reversed 
its position once again with the so-called 
New Law on Biodiversity (7, 8), striking a 
potentially fatal blow to biodiversity. 
Disguised as a milestone of progress for 
scientific research and endorsed by some 
scientific sectors (9), this law constitutes 
a monumental setback. According to the 
law, which is retroactive to 2000 or 2015, 
depending on the area of research (8), 
and technically applies to researchers at 
both Brazilian and foreign institutions 
(10), virtually every research activity on 
Brazilian biodiversity—even nongenetic 
studies in taxonomy, ecology, physiol- 
ogy, and behavior—must be registered in 
the National System of Genetic Resource 
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Management and Associated Traditional 
Knowledge (SisGen), which was created to 
assist the CGen. The deadline for meeting 
these draconian rules is 5 November 2018 
(9, 11). Failure to comply, including the 
unregistered dissemination of scientific 
results, even if based only on public 
databases like GenBank or previously 
published data or results, will result in 
heavy fines (7). A lecture given by the co- 
ordinator of the Academic Sectoral Board 
of the CGen, titled “How not to be fined,” 
highlights the punitive spirit of the law 
(10). Paradoxically, commercial activities 
involving Brazilian biota, such as export 
of ornamental fishes, plants, grains, and 
other marketable products, remain unaf- 
fected by the law (7). 


Broad interests: 
Benefits for science 


Add your voice to Science! Our new 
NextGen VOICES survey is now open: 


How do broad interests benefit your 
science? Describe one non-science 
interest or hobby, and explain how it 
has made you a better scientist. 


To submit, go to www.sciencemag. 
org/nextgen-voices 


Deadline for submissions is 1 June. 
A selection of the best responses 
will be published in the 6 July issue 
of Science. Submissions should be 
150 words or less. Anonymous 
submissions will not be considered. 


Published by AAAS 


The pumpkin toadlet is just one example of the 
diverse fauna endemic to Brazil. 


If not repealed or substantially over- 
hauled, this Byzantine labyrinth of unnec- 
essary demands and threats will decimate 
scientific research on Brazilian biodi- 
versity by requiring scientists to divert 
an inordinate amount of already limited 
resources from research to the time-con- 
suming process of registering every speci- 
men, DNA sequence, photograph, and any 
other observation of Brazilian biodiversity 
before publication, presentation at scien- 
tific meetings, or dissemination to media 
outlets. Scientists must take back the 
reins of their own activities by demanding 
that the Brazilian government imple- 
ment laws that facilitate international 
collaboration and encourage biodiversity 
research instead of stifling it. Otherwise, 
a substantial part of the world’s biodiver- 
sity and its benefits may silently vanish 
behind a wall of bureaucracy. 
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The road to wild yak 
protection in China 


China is home to about 22,000 wild yaks, 
which account for 90% of the global wild 
yak population (7). The International Union 
for Conservation of Nature (IUCN) catego- 
rizes the wild yak, a cold-tolerant herbivore, 
as a vulnerable species (2), mainly attrib- 
utable to excessive hunting for food and 
trade. China’s wild yak is also threatened 

by land-use change, disease, environmental 
pollution, genetic contamination, climate 
change, and resource competition (J, 3, 4). In 
recent decades, infrastructure construction 
in China has grown increasingly disruptive 
to remaining wild yak populations. 

Most wild yaks live in or near the Tibetan 
Plateau (7). These regions are located in 
or adjacent to areas zoned for the Western 
Development Strategy (5), an ambitious plan 
proposed in 1999 to increase the economic 
level and quality of life of China’s rural 
citizens. The plan’s implementation has 
accelerated the construction of railways and 
roads. Railways in the central western region 
of China, which account for 76.6% of China’s 
current rail traffic (6), expanded from 70,000 
km in 2014 (7) to 95,000 km in 2016 (6). 
Meanwhile, highway density increased from 
7:7 km/100 km? in 1999 to 20.6 km/100 km? 
in 2008 (8). The increasing density of rail- 
ways and roads has fragmented the habitats 
of wild yaks and forced them to migrate to 
resource-limited areas to escape from preda- 
tion and conflict with humans. 

China’s wild yaks are an important genetic 
resource for breeding new yak species in an 
effort to sustainably develop animal hus- 
bandry in the Tibetan area and enrich the 
region’s biodiversity (7, 9). Wild yaks are also 
an important component of biodiversity in 
nature (9). To protect China’s wild yaks, the 
Chinese government has built several nature 
reserves (JO, 11), but even there, the yaks are 
threatened by illegal hunting (0). To ensure 
the safety of China’s wild yaks, China must 
further expand the nature reserves and 
effectively enforce existing hunting bans. 
The government should also implement 
scientific management and protection poli- 
cies that minimize habitat fragmentation, 
resource plunder, and predation. 
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Trout in hot water: 
A call for global action 


Trout are one of the most culturally, econom- 
ically, and ecologically important taxonomic 
groups of freshwater fishes worldwide (J). 
Native to all continents in the Northern 
Hemisphere, trout belong to seven genera, 
which are distributed across 52 countries. 
These cold-water specialists provide recre- 
ation and food to millions of people and play 
important roles in ecosystem functioning 
and health (2). They are also extremely sen- 
sitive to human disturbances because they 
require cold, clean, complex, and connected 
habitats for survival and persistence (3)—all 
attributes that humans have substantially 
altered and degraded (4, 5). Despite their 
importance as societal icons and as indica- 
tors of biodiversity, many of the world’s trout 
species and lineages are endangered and 
some require immediate conservation efforts 
to reverse their precarious decline. 

Of the 124 recognized species of trout 
(6, 7), only 67 (54%) have been assessed by 
the International Union for Conservation 


of Nature (IUCN) (7). Alarmingly, 73% of 
these species are currently threatened 
with global extinction, and four are now 
extinct (7). Although some of these spe- 
cies are likely subspecies, lineages, or 
distinct ecotypes, this level of threat is 
exceptionally high compared with other 
vertebrate groups assessed by the IUCN 
(8). Widespread imperilment of trout 
reflects numerous human activities identi- 
fied in the IUCN assessments, including 
invasive species, overfishing, pollution, 
dams, deforestation, agriculture, grazing, 
and mining (7). Climate change is further 
stressing trout populations by warming 
water temperatures, shifting streamflow 
regimes, increasing extreme events (such 
as floods, drought, and wildfire), and 
facilitating species invasions (9, 10). Worse, 
climate change often acts in synergy with 
other stressors to further endanger trout 
(11, 12), a pattern that will be intensified 
in coming decades as global temperatures 
continue to rise, with important con- 
sequences for trout, aquatic ecosystem 
functioning, and human well-being. 
Reversing these declines will require 
progressive conservation efforts to protect 
native trout diversity and ameliorate ongo- 
ing and future threats at local and global 
scales. To preserve these unique fishes, we 
must protect ecological and genetic diversity, 
which are critical for long-term resiliency, 
viability, and adaptation in the face of rapid 
environmental change. Innovative conser- 
vation approaches include reconnecting 
rivers with floodplains, establishing native 
fish refuges, restoring habitat diversity, and 
reducing invasive species, including non- 
native trout stocking programs. Moreover, 
comprehensive, coordinated, and compa- 
rable approaches are needed immediately to 
assess conservation status and to delin- 
eate conservation units across the globe, 
particularly for data-poor species. Only by 


Many trout species, such as these native bull trout, are endangered. 
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addressing threats at their root causes can 
we accomplish these conservation goals. 
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Comment on “Plant diversity increases 
with the strength of negative density 
dependence at the global scale” 


Lisa Hiilsmann and Florian Hartig 
LaMannaet al. (Reports, 30 June 2017, p. 
1389) claim that subadult trees are propor- 
tionally less common at high conspecific adult 
density (CNDD), and that this effect increases 
toward the tropics and for rare species. We 
show that the CNDD-abundance correlation 
may have arisen from a methodological arti- 
fact and that a range of processes can explain 
the reported latitudinal patterns. 

Full text: dx.doi.org/10.1126/science.aar2435 


Response to Comment on “Plant diversity 
increases with the strength of negative den- 
sity dependence at the global scale” 


Joseph A. LaManna, Scott A. Mangan, 
Alfonso Alonso, Norman A. Bourg, Warren 
Y. Brockelman, Sarayudh Bunyavejchewin, 
Li-Wan Chang, Jyh-Min Chiang, George B. 
Chuyong, Keith Clay, Susan Cordell, Stuart 
J. Davies, Tucker J. Furniss, Christian P. 
Giardina, I. A. U. Nimal Gunatilleke, C. V. 
Savitri Gunatilleke, Fangliang He, Robert 
W. Howe, Stephen P. Hubbell, Chang-Fu 


Hsieh, Faith M. Inman-Narahari, David 
Janik, Daniel J. Johnson, David Kenfack, 
Lisa Korte, Kamil Kral, Andrew J. Larson, 
James A. Lutz, Sean M. McMahon, 
William J. McShea, Hervé R. Memiaghe, 
Anuttara Nathalang, Vojtech Novotny, 
Perry S. Ong, David A. Orwig, Rebecca 
Ostertag, Geoffrey G. Parker, Richard 

P. Phillips, Lawren Sack, I-Fang Sun, 

J. Sebastian Tello, Duncan W. Thomas, 
Benjamin L. Turner, Dilys M. Vela Diaz, 
Tomas Vrska, George D. Weiblen, Amy 
Wolf, Sandra Yap, Jonathan A. Myers 
Hulsmann and Hartig suggest that eco- 
logical mechanisms other than specialized 
natural enemies or intraspecific competition 
contribute to our estimates of conspecific 
negative density dependence (CNDD). To 
address their concern, we show that our 
results are not the result of a methodological 
artifact and present a null-model analysis 
that demonstrates our original findings—(i) 
stronger CNDD at tropical relative to temper- 
ate latitudes, and (ii) a latitudinal shift in the 
relationship between CNDD and species 
abundance—persist even after controlling for 
other processes that might influence spatial 
relationships between adults and recruits. 
Full text: dx.doi.org/10.1126/science.aar3824 


Comment on “Plant diversity increases 

with the strength of negative density 
dependence at the global scale” 

Ryan A. Chisholm and Tak Fung 

LaManna et al. (Reports, 30 June 2017 p. 
1389) found higher conspecific negative 
density dependence in tree communities at 
lower latitudes, yielding a possible mecha- 
nistic explanation for the latitudinal diversity 
gradient. We show that their results are arti- 
facts of a selective data transformation and 

a forced zero intercept in their fitted model. A 
corrected analysis shows no latitudinal trend. 
Full text: dx.doi.org/10.1126/science.aar4685 


Response to Comment on “Plant diversity 
increases with the strength of negative 
density dependence at the global scale” 


Joseph A. LaManna ef al. 

Chisholm and Fung claim that our 

method of estimating conspecific negative 
density dependence (CNDD) in recruitment 
is systematically biased, and present an 
alternative method that shows no latitudinal 
pattern in CNDD. We demonstrate that their 
approach produces strongly biased estimates 
of CNDD, explaining why they do not detect 

a latitudinal pattern. We also address their 
methodological concerns using an alternative 
distance-weighted approach, which supports 
our original findings of a latitudinal gradient in 
CNDD and a latitudinal shift in the relationship 
between CNDD and species abundance. 

Full text: dx.doi.org/10.1126/science.aar5245 
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Comment on “Plant diversity increases 
with the strength of negative density 
dependence at the global scale” 


Lisa Hiilsmann* and Florian Hartig 


LaManna et al. (Reports, 30 June 2017, p. 1389) claim that subadult trees are proportionally 
less common at high conspecific adult density (CNDD) and that this effect increases toward the 
tropics and for rare species. We show that the CNDD-abundance correlation may have 

arisen from a methodological artifact and that a range of processes can explain the reported 


latitudinal pattern. 


onspecific negative density dependence 
(CNDD) has long been discussed as a key 
mechanism for maintaining local species 
richness and global biodiversity patterns. 
LaManna et al. (1) analyzed CNDD in 24 
plots of the global CTFS-ForestGEO network, de- 
fining density dependence as an effect of local 
adult density on the recruit/adult ratio in 10 m x 
10 mand 20 m x 20 m quadrats. Fitting this ratio 
against conspecific adult density with Ricker and 
offset-power models, they found that CNDD in- 


creases toward the tropics and with species 
rarity. 

After carefully examining their statistical anal- 
ysis, we believe that these patterns, intriguing as 
they are, may have arisen from a methodological 
artifact. We simulated synthetic data under a 
range of processes with and without CNDD (2) 
and found that both Ricker and offset-power 
models produce biased CNDD estimates as soon 
as a process breaks the spatial coupling between 
adults and recruits. For example, dispersal, adult 


mortality, or niche processes (3-5) can produce 
CNDD and CNDD-abundance correlations com- 
patible with values reported in LaManna et al., 
despite no CNDD being present whatsoever (Fig. 1). 
Moreover, the strength of the bias depends on 
community characteristics such as species rich- 
ness or adult proportion, which suggests an al- 
ternative explanation for the reported latitudinal 
pattern. 

One can understand the reason for this bias. If 
recruitment is not entirely local, the assumption 
that local counts of adults (A) and recruits (S) 
should be directly proportional does not hold, 
even without any CNDD (6). A special example of 
this is the frequent quadrats with recruits but 
no adults in the data. To explain these observa- 
tions, the authors introduced a background adult 
density of A = 0.1 in the Ricker model. This 
decision, however, creates S/A ratios of =10 for 
the corrected observations with A = 0.1and S > 0, 
much higher than the mean ratio of approxi- 
mately 4 in the data where A > 0. To explain 
these seemingly high reproduction ratios in quad- 
rats with only background adult density, the 
Ricker model must wrongly assume high intrinsic 
growth paired with strong density dependence 
(Fig. 2C). And because S > 0 and A = 0 occur more 
frequently for rare species and in the tropics, 
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Fig. 1. Processes and community characteristics that create artifacts in 
estimated CNDD patterns. (A and B) Weighted mean CNDD (A) and CNDD- 
abundance correlations (B) estimated by the Ricker model with simulated 
data, varying ecological parameters (CNDD, dispersal, adult survival, habitat 
specificity), and community characteristics (species richness, proportion of 


Hiilsmann et al., Science 360, eaar2435 (2018) 


Species abundance logj9(N/ha) 


25 May 2018 


adults) at the 10 m x 10 m scale. CNDD is zero except for the CNDD subplots 
where the same value was used for all species. See (2) for details of the simulation 
settings. Line colors correspond to parameter values in the upper subpanels; 
black lines depict CNDD-abundance correlations for the tropical Barro Colorado 
Island (BCI) (16) forest plot. Parameters estimation follows LaManna et al. 
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CNDD values and CNDD-abundance correla- 
tions emerge that are compatible with the patterns 
reported in LaManna et al., even when no CNDD 
is present (Fig. 2A). 

In the offset-power model, zeros were dealt 
with differently by adding a value of 1 to all ob- 
servations. Again, this distorts recruit/adult ratios, 
as S/A = (S + 1)/(A + 1), and more so for rare 
species with smaller values of S and A. When 
LaManna et al. then fitted a mixed-effects model 
with a random species effect on the CNDD slope 
but no random intercept (7), the rarity-dependent 
S/A distortion is compensated by species-specific 
CNDD (Fig. 2B). When we fitted the more natural 
standard random slope and intercept model that 
accounts for species-specific S/A ratios, the CNDD- 
abundance correlation vanished in simulated 
and real data (Fig. 2D). 

LaManna et al. realized that CNDD estimates 
could be biased and therefore ran a number of 
simulations and null models to explore and po- 
tentially correct this bias. However, our results 
(Figs. 1 and 2) clearly contradict their assertion 
that their “results are generally robust to these 
potential biases.” Examining their null models, 
we find several shortcomings: (i) Their analyses 
and corrections often only pertained to mean 
values and not to CNDD-abundance correlations, 
and where they did, simulations were not run 
with those critically low abundances that distort 
CNDD estimates; (ii) their null simulations in- 
cluded only dispersal, but none of the other fac- 
tors that we show to create bias (Fig. 1); and (iii) 


their assumption that only 10% of the recruits 
derive from outside the quadrats produces un- 
realistically low dispersal relative to empirical 
values for primary dispersal (8). Even a uniform 
kernel with a 10-m radius would predict 52% of all 
seeds to disperse outside a 10 m x 10 m quadrat. 
With our null simulations, we tend to find clear 
CNDD signals and CNDD-abundance correlations 
similar to what LaManna et al. reported under 
perfectly neutral (CNDD = 0) conditions. Accord- 
ingly, we find their low CNDD estimates for 
rare temperate species far more surprising than 
strong CNDD and CNDD-abundance slopes in 
the tropics. 

A reliable correction of CNDD estimates, even 
with more realistic null models, seems very dif- 
ficult. One would require species-specific values 
for dispersal and all other spatial processes that 
we have shown to influence CNDD estimates (9). 
Species-specific values are important because a 
correlation of traits such as dispersal or habitat 
specificity with species abundance or latitude 
could create additional artifacts (10), let alone 
the effect of intraspecific trait variation. Based 
on this, we believe it is virtually impossible to 
devise appropriate corrections or null models 
for the analysis presented by LaManna et al. 
Directly analyzing CNDD in demographic rates 
such as growth and mortality of saplings (17), 
which are much less affected by dispersal, seems 
far more promising to us. 

The biased estimator questions the evidence 
for the reported CNDD patterns, but does not 
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for details of the simulation settings. 
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constitute proof of their absence. Yet, when 
combining our analysis with general ecological 
knowledge, we find it rather unlikely that the 
patterns reported in LaManna et al. are primarily 
caused by CNDD. A previous study found no 
support for a latitudinal CNDD pattern in tree 
mortality (72), and although some studies have 
reported CNDD-abundance correlations in the 
tropics (11, 13), effect sizes tended to be much 
smaller than what LaManna et al. reported, par- 
ticularly as they inappropriately contrasted CNDD 
and heterospecific density effects. Strong CNDD- 
abundance correlations also seem unlikely from 
theoretical considerations. It is frequently as- 
sumed that CNDD is mediated by specialized 
pathogens (14). It would be surprising if such 
strict mutualisms could persist and exert effec- 
tive control for exceedingly rare species. More- 
over, if CNDD applied only to rare species, these 
would be stabilized against each other, but the 
question remains how infrequent species could 
escape competition from common ones that do 
not suffer from strong CNDD. 

If not real CNDD, what then causes the lati- 
tudinal patterns in the (biased) CNDD estimates? 
Diversity differences alone could create a latitu- 
dinal pattern in mean CNDD but would not lead 
to a latitudinal shift of the CNDD-abundance 
correlation (Fig. 1). However, either latitudinal 
differences in spatial processes [e.g., stronger 
habitat effects or reduced dispersal in temperate 
regions (9)] or latitudinal differences in the size 
structure could create the pattern. Size structure 
is a highly sensitive parameter for the bias be- 
cause LaManna et al. distinguished adults and 
recruits according to their diameter (Fig. 1, adult 
proportion). Because the original data were not 
fully available to us, we could not examine these 
possibilities in more detail. 

We believe that LaManna et al.’s results should 
be interpreted far more carefully. Given the in- 
stability of the statistical methods, we do not see 
strong evidence for any of the reported CNDD 
patterns. To us, the findings mainly suggest that 
the spatial association between adults and recruits 
in the tropics is surprisingly weak, as already 
reported by (75), and that either this spatial 
association becomes stronger toward the tem- 
perate zone, or another parameter (possibly 
size structure) creates the latitudinal pattern 
in the biased estimator. A latitudinal gradient 
of abundance-dependent CNDD would also be 
compatible with our findings, but overall, we 
find it far more likely that a range of ecological 
processes—possibly in combination with CNDD, 
but not primarily driven by it—are responsible 
for the reported patterns. 
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Response to Comment on “Plant 
diversity increases with the strength 
of negative density dependence at the 


global scale” 
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Hiilsmann and Hartig suggest that ecological mechanisms other than specialized natural 
enemies or intraspecific competition contribute to our estimates of conspecific negative 
density dependence (CNDD). To address their concern, we show that our results are not the 
result of a methodological artifact and present a null-model analysis that demonstrates that our 
original findings—(i) stronger CNDD at tropical relative to temperate latitudes and (ii) a 
latitudinal shift in the relationship between CNDD and species abundance—persist even after 
controlling for other processes that might influence spatial relationships between adults 


and recruits. 


o explore potential bias in our estimates 
of conspecific negative density dependence 
(CNDD) (J), Hiilsmann and Hartig (2) 
present simulations that vary several pro- 
cesses, including dispersal, habitat specific- 
ity, adult/recruit ratios, and species richness. 


Some of these simulations produce spuriously 
strong CNDD for rare species, leading them to 
suggest that our methods might be biased. If this 
were correct, then our estimates of CNDD would 
be biased toward stronger effects for rare species 
at any latitude. However, this was not the case, 


because our original estimates of CNDD varied 
substantially among rare species when matched 
for abundance [figure 2D in (J)]. Furthermore, 
median CNDD for rare species differed across 
latitudes, with rare species having stronger 
median CNDD in tropical than in temperate 
forests [figure 2C in (J)]. 

The strong negative biases in CNDD that 
Hiilsmann and Hartig observe for rare species are 
erroneously generated because their simulation 
models assume that a certain proportion of re- 
cruits are globally dispersed across an entire 50-ha 
forest plot on Barro Colorado Island (BCI). This 
assumption is biologically unrealistic because the 
vast majority of species at BCI exhibit dispersal 
limitation (3), spatially clumped distributions 
(4, 5), and an average mean dispersal distance of 
28 m (6). Although such distances allow some 
recruits to disperse quite far from parent trees, 
recruit density is still greatest around the parent 
(7, 8). That is why analyses that assume global 
dispersal, as in Hiilsmann and Hartig, under- 
estimate or fail to detect CNDD when it is ac- 
tually present (8) (Fig. 1). Moreover, the assumption 
of global dispersal results in recruits of rare species 
being farther from adults than recruits of common 
species (7 = —0.43, P < 0.0001, NV = 187 species in the 
BCI data set), mimicking the same pattern that 
would be produced if CNDD were stronger for 
rare species. Even where Hiilsmann and Hartig 
relax the assumption of completely global disper- 
sal, a large proportion of recruits are still randomly 
dispersed across the entire plot (e.g., 50% directly 
under parent tree, 50% globally dispersed; Fig. 1). 
Moreover, these empirically unjustifiable assump- 
tions about dispersal permeate their simulations 
of other processes (i.e., the authors use some 
degree of global dispersal for all other panels in 
their figure 1). Thus, the unrealistic assumption 
of global dispersal largely generates the biases 
that Hilsmann and Hartig claim to observe. 

Hiilsmann and Hartig suggest that our use of 
an offset value introduced bias in our CNDD 
estimates. We applied an offset value to retain 
recruits located in 20 m x 20 m quadrats without 
a conspecific adult in the calculation of CNDD (J). 
Retention of all recruits was important because, 
in many cases, recruits in quadrats without adults 
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likely came from adults in adjacent quadrats. 
The mean distance (+SE) between recruits in 
these quadrats and the nearest conspecific adult 
was 26.95 + 0.04 m (24.16 + 0.19 m and 27.08 + 
0.04 m for temperate and tropical trees, re- 
spectively), and 80% were within 36 m of a con- 
specific adult (31 and 36 m for temperate and 
tropical trees, respectively). These distances are 
well within average dispersal kernels for tree 
species (6, 9). Therefore, we applied an offset 
value to quadrats with recruits but no conspecific 
adults, so as to ensure that these recruits re- 
mained in the calculation of CNDD and to avoid 
bias that results from excluding these recruits 
(1, 10, 11). However, adding the offset value to all 
quadrats did not qualitatively change either the 
relationship between species rarefied richness 
and CNDD across latitudes (7 = —0.877, P < 0.001) 
or the latitudinal shift in the relationship be- 
tween CNDD and species abundance (7° = —0.552, 
P = 0.006). Moreover, these findings persisted 
when we used an alternative distance-weighted 
approach to estimate CNDD that avoids the use 
of an offset altogether (77). Therefore, the main 
findings of our original paper are robust to the 
statistical approach used to estimate CNDD. 

To further verify that our results reflect changes 
in CNDD across species and latitudes, we used a 
null model similar to the model used by Hiilsmann 


A 28m mean dispersal (2dT kernel) 


and Hartig that simply modifies the assumption 
of global dispersal. The model, recommended by 
Wiegand and Moloney (12), fixes adult locations 
and disperses recruits away from adults given 
some dispersal kernel. Instead of globally dis- 
persing all or a fraction of recruits (as done by 
Hiilsmann and Hartig), we use the best-fitting 
model for seed dispersal across tropical and 
temperate forests (Clark’s 2dT dispersal kernel) 
(6, 7, 9, 12). As in Hiilsmann and Hartig’s model, 
our null model also preserves habitat specificity 
(adult locations), adult/recruit ratios, and abun- 
dances for each species to test whether these 
processes could have generated artificial CNDD 
patterns. As suggested by Wiegand and Moloney, 
we also modeled immigration by treating each 
forest plot as a torus, allowing recruits dispersing 
off the plot to immigrate back into the plot from 
the other side (72). We performed two versions 
of this null model: (i) with every species having 
a fixed mean dispersal distance of 30 m [ap- 
proximating the mean across >60 tropical and 
temperate species (6, 7)] and (ii) with species 
having allometrically scaled dispersal distances 
that incorporate intraspecific variation. For the 
allometric-dispersal model, we modeled inter- 
and intraspecific variation in dispersal using 
a recent meta-analysis, which showed that more 
than half of the total variation in mean dis- 
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persal distance across >200 plant species scaled 
allometrically with their maximum height (9). We 
used this relationship, along with observed varia- 
tion around this relationship, to simulate a 
wide range of empirical dispersal distances for 
each species given its maximum height and to 
calculate null-expected values of CNDD for each 
species in our data set (12). 

The results of this null-model analysis using the 
distance-weighted measure of adult abundance 
[described briefly above and presented in detail 
in (11)] are shown in Figs. 2 and 3. The expected 
values of CNDD were near zero and did not show 
evidence of strong bias across rare and common 
species. Results from these null-model analyses 
supported the conclusions from our original paper 
(1). Moreover, similar results were obtained when 
we applied the same null-model analysis using 
the Ricker model with offset approach presented 
in (J). Therefore, although we acknowledge that 
other processes such as dispersal and habitat spec- 
ificity may contribute to our estimates of CNDD, 
null-model results strongly suggest that these pro- 
cesses alone cannot account for observed patterns 
in CNDD across latitudes and species. Instead, our 
results are consistent with the idea that density 
dependence, caused by specialized natural enemies 
and/or intraspecific competition, strongly con- 
tributes to the latitudinal diversity gradient. 


Fig. 1. Simulated spatial patterns of dispersal. 
(A to C) Simulated spatial patterns of 1000 
recruits (blue points) relative to a parent tree (red 
point) in a 50-ha plot the same size as the plot at 
BCI. (D to F) Recruits per m* as a function of 
distance from the parent. Dispersal patterns 
were produced by Clark's 2dT dispersal kernel 

(7) at the average observed mean dispersal 
distance across more than 60 tropical and 
temperate species (6, 7) [(A) and (D)], by 
stratified dispersal (used in Htlsmann and 
Hartig) whereby a certain proportion of recruits 
are dispersed into the same quadrat as the adult 
and the rest are globally dispersed across the 
entire forest plot [(B) and (E)], and by complete 
global dispersal [(C) and (F)]. Blue lines in (D) to 
(F) show the expected recruit density for each 
dispersal kernel [(D), observed mean dispersal 
from the literature; (E), stratified; (F), global]. The 
pink shaded areas show the expected recruit 
density using the observed mean dispersal from 
the literature [same as in (D)] (6, 7). The 
assumption of global dispersal is inappropriate 
because it vastly underestimates expected recruit 
density near the parent tree [compare blue lines 
with pink shaded areas in (E) and (F)] and 
produces a pattern identical to that expected 
under strong CNDD (8). Even stratified dispersal 
(E) underestimates the density of recruits between 
10 and 30 m. Recruits dispersing off the plot 
were retained by treating the plot as a torus (12). 
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Fig. 2. Results from a null model that incorporates 
empirically supported values of dispersal and 
preserves habitat specificity, adult/recruit ratios, 
and abundances for each species. The null model, 
recommended by Wiegand and Moloney (12), 

fixes adult locations to preserve habitat specificity of a 
species and then disperses recruits away from adults. 
This is the same model used by HUlsmann and 
Hartig, except that it incorporates empirically 
supported dispersal values. In this version of the null 
model, all species were given the same mean dispersal 
distance of 30 m, based on empirical estimates for 
more than 60 tropical and temperate species (6, 7). 
(A to F) A distance-weighted measure of adult 
abundance was used to calculate both observed [(A) 
to (C)] and simulated values [(D) to (F)] of CNDD (11). 
Observed and simulated values are presented on the 
same scale for comparison. The analysis was performed 
at the 20 m x 20 m scale. (G to I) Standardized 

effect sizes are the observed value minus the mean 
simulated value from 100 iterations of the null model 
divided by the standard deviation of the simulated 
values. These results support the main results in our 
original paper [figures 1 and 2 in (1)]. Colors reflect 
distance from the equator [see legend of figures 1 and 
2 in (1)]. Linear fits are shown, along with Spearman 
rank correlation coefficients and their P values. Gray 
dashed lines represent a median CNDD of zero [(A), (C), 
(D), (F), (G), and (I)] or a zero slope between CNDD 
and species abundance [(B), (E), and (H)]. 


Fig. 3. Results from a null model that incorpo- 
rates empirically supported interspecific differ- 
ences in dispersal and preserves habitat 
specificity, adult/recruit ratios, and abundances 
for each species. The null model is the same as the 
model shown in Fig. 2, except that all species are 
given a mean dispersal distance that scales allo- 
metrically with their maximum height and incorpo- 
rates variation in dispersal within species (9). 
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The null-model results we report here support 
the main conclusions of our original paper: 
CNDD was (i) stronger in tropical than in temper- 
ate forests and (ii) stronger for rare species than 
for common species in the tropics, but equivalent 
or weaker for rare species than for common species 
in temperate latitudes (7). The latter finding is 
supported by both observational and experimen- 
tal studies of tropical seedling growth and sur- 
vival as well as temperate sapling recruitment 
(13-15). Hiilsmann and Hartig raise the common 
misconception that strong negative density de- 
pendence for rare species should always increase 
their likelihood of local extinction. This is true if 
stabilizing forces such as CNDD are lacking in 
rare species. However, theory shows that strong 
CNDD can promote the persistence of rare species 
in communities, even when CNDD is relatively 
weaker for common species (16), as we observed 
in tropical forests (J). Under these conditions, 
rare species that are strongly stabilized by in- 
teractions with specialized natural enemies or 
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other conspecifics (i.e., strong CNDD) should be 
less susceptible to local extinction from ecologi- 
cal drift than rare species that are not stabilized 
(, 16). Our results not only are consistent with 
this idea, they suggest fundamental differences 
in the nature of local stabilizing interactions that 
contribute to the maintenance of species diver- 
sity across temperate and tropical latitudes. 
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LaManna et al. (Reports, 30 June 2017, p. 1389) found higher conspecific negative density 
dependence in tree communities at lower latitudes, yielding a possible mechanistic 
explanation for the latitudinal diversity gradient. We show that their results are artifacts 
of a selective data transformation and a forced zero intercept in their fitted model. 

A corrected analysis shows no latitudinal trend. 


n ecological systems, conspecific negative 
density dependence (CNDD) occurs when 
individuals are more negatively affected by 
neighboring conspecific individuals than by 
neighboring heterospecific individuals (/-4). 
Communities with stronger CNDD should, all 
else equal, have higher diversity, because CNDD 
prevents any one species from becoming too 
abundant. Accordingly, it is theoretically pos- 
sible that global variation in CNDD drives global 
variation in diversity (5). In particular, the lati- 
tudinal diversity gradient, whereby diversity 


decreases with increasing latitude, could be 
driven by a concomitant weakening in CNDD 
with increasing latitude. Thus, there is scientific 
interest in testing the hypothesis that species in 
tropical high-diversity sites are more suscepti- 
ble to conspecific competition than are their 
temperate counterparts. 

LaManna et al. (6) tested this hypothesis by 
analyzing the relationship between sapling den- 
sity and adult density for nearly 2.4 million trees 
of more than 3000 species across 24 forest plots 
worldwide. For each species at each site, they 


fitted the Ricker population dynamics model 
(7) to static counts of adults and saplings with- 
in 10 m x 10 m or 20 m x 20 m quadrats. They 
found that the fitted CNDD parameter of the 
Ricker model was, on average, substantially more 
negative in high-diversity, low-latitude forests, 
providing apparent support for the hypothesis 
that CNDD explains high tropical tree diversity. 

However, the analysis of LaManna e¢ al. has 
two flaws. First, they transformed a subset of 
their data prior to analysis while leaving the 
remainder of the data untransformed, which 
biased the model results. For each species, they 
added +0.1 to the adult abundance in any quad- 
rat that had nonzero sapling abundance and 
zero adult abundance, but did not transform 
adult abundance values in other quadrats. The 
problem LaManna et al. were trying to address 
with this selective transformation was that the 
Ricker model has a zero intercept and there- 
fore cannot account for observations that have 
nonzero sapling abundance and zero adult abun- 
dance. However, the selective transformation 
they used is not valid (and renders goodness- 
of-fit statistics meaningless). Indeed, we cannot 
think of any circumstances under which it is 
valid to transform only a subset of data prior 
to analysis. 

How did the selective transformation used by 
LaManna et al. introduce a specific bias toward 
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Fig. 1. The selective transformation and forced zero intercept used by 
LaManna et al. inflate CNDD, especially for rare species. (A and B) 
Sapling abundance versus adult abundance for two illustrative species at 
the Barro Colorado Island plot in Panama, Chrysophyllum argenteum 

(a rare species) and Desmopsis panamensis (a common species), across 
quadrats [gray points, with sizes proportional to log(number of 

quadrats + 1)]. The selective transformation applied by LaManna et al. 
(red points) leads to a fitted model (red curves) that has an inflated 
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intrinsic growth rate and inflated CNDD relative to the case with no 
selective transformation (blue curves). The bias is more acute for rarer 
species (A). Even without the selective transformation, the Ricker model 
(blue curves) provides a poor fit to the average trend (black lines) 
because of the forced zero intercept; a Ricker model with an intercept 
term characterizes the data more accurately (green curves; 

Fiicas = 5372.5, P = 6 x 10-4 for C. argenteum; Fy1245 = 1800.1, 

P=5 x 10°*“4 for D. panamensis). 
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higher CNDD in tropical species-rich forests? 
The selective transformation moved data points 
with nonzero sapling abundance and zero adult 
abundance off the vertical axis, which caused 
the fitted Ricker model to estimate an artifi- 
cially high intrinsic growth rate and consequently 
artificially strong negative density dependence to 
compensate (Fig. 1). This effect was more pro- 
nounced for rare species, which have a higher 
proportion of presence quadrats with nonzero 
sapling abundance and zero adult abundance, 
and thus a higher proportion of transformed 
data points (e.g., Fig. 1A; Spearman 7 = -0.25, 
P=2x 10° for the relationship between spe- 
cies abundance and fraction of presence quad- 
rats that were transformed). This then led to 
more strongly biased CNDD estimates for rare 
species. Because species-rich sites have more rare 
species than species-poor sites, the fraction of data 
points affected by the selective transformation 
of LaManna et al. was positively correlated with 
site richness (Spearman r = 0.95, P = 2 x 10°°). 
For example, at the most species-poor site, Zofin, 
the average abundance is 212.7 trees/ha and 
LaManna et al. transformed 7% of presence 
quadrats, whereas at the most species-rich site, 
Khao Chong, the average abundance is 6.4 trees/ha 
and they transformed 50% of presence quadrats. 
The higher fraction of selectively transformed 
points at species-rich sites led to a stronger bias 
in CNDD estimates at these sites and ultimately 
inflated the latitudinal gradient in estimated 
CNDD. LaManna et al. also investigated shifts 
of +0.01 and +0.001 (their table S19), but these 
analyses suffer from the same rare-species bias. 
The second problem in the analysis of 
LaManna et al. was that the Ricker model 
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Absolute latitude (deg) 


was forced to have a zero intercept, when the 
data do not suggest this (e.g., Fig. 1) and the re- 
sulting residual plots are skewed (not shown by 
LaManna et al.). We fitted a Ricker model with 
a nonzero intercept to the same forest data 
and obtained better fits than with a zero inter- 
cept [as judged by F tests: P < 0.05 for 99.4% of 
species; P < 3 x 10 ° (Bonferroni-adjusted thresh- 
old) for 98.3% of species]. Statistically, the non- 
zero intercept is a valid solution (unlike the 
selective transformation of LaManna et al.) 
to the problem of quadrats with nonzero sapling 
abundance and zero adult abundance. Biolog- 
ically, the nonzero intercept for each species 
can be interpreted as the central tendency of a 
stochastic immigration process that does not 
vary systematically with the number of adults 
in a quadrat. With our improved model, we found 
that the median estimated CNDD over all species 
at all plots was lower by about a factor of 30 than 
in LaManna et al. the relationship between plot 
richness and estimated CNDD disappeared, and 
the latitudinal pattern in estimated CNDD also 
disappeared (Fig. 2). 

LaManna et al. claimed that their results were 
robust because qualitatively similar patterns arose 
when they excluded quadrats with nonzero sap- 
ling abundance and zero adult abundance. But 
doing this remedied only the problem of se- 
lective transformation and not the problem of 
using a zero intercept. LaManna et al. also claimed 
that their results were robust because qualitatively 
similar patterns arose when they fitted a model 
with a different functional form: the “offset- 
power model” presented in their supplementary 
materials. However, the offset-power model also 
suffers from fatal statistical flaws. The offset- 
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Fig. 2. Species richness exhibits no system- 
atic relationship with the strength of CNDD 
across tropical and temperate forests. 

(A) Species richness versus estimated CNDD 
(median over all species in a plot) for the 

24 forest plots of LaManna et al. (points) with 
fitted least-squares regression line. (B) Esti- 
mated CNDD versus latitude. The methods and 
data here are identical to those used to produce 
the results of LaManna et al., except that we 
have not applied a selective data transformation 
prior to analysis and we have used an intercept 
in the fitted Ricker model (see text and Fig. 1; the 
graphs here should be directly compared with 
figures 1E and IC, respectively, in LaManna et al.). 
Spearman rank correlation coefficients (r) with 

P values are shown. 


power model of LaManna et al., as shown by 
their own simulations, estimated strong CNDD 
even when there was none (figure S11B in 
LaManna et al. shows that the parameter 7 is 
estimated at less than 0.4 on average when the 
true value is 1.0). 

Overall, we found that the main results of 
LaManna et al. are artifacts of a selective trans- 
formation of the data prior to analysis and a 
forced zero intercept not suggested by the data. 
Our corrected analysis yields much weaker CNDD 
overall and no global CNDD gradient with species 
richness or latitude. We recommend that future 
studies test the robustness of our conclusion by 
using models that are fully spatial, dynamic, and 
stochastic. 
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Chisholm and Fung claim that our method of estimating conspecific negative density 
dependence (CNDD) in recruitment is systematically biased, and present an alternative 
method that shows no latitudinal pattern in CNDD. We demonstrate that their approach 
produces strongly biased estimates of CNDD, explaining why they do not detect a 
latitudinal pattern. We also address their methodological concerns using an alternative 
distance-weighted approach, which supports our original findings of a latitudinal gradient 
in CNDD and a latitudinal shift in the relationship between CNDD and species abundance. 


hisholm and Fung (J) claim that our sta- 
tistical approach (2) produces CNDD esti- 
mates that are systematically biased, causing 

rare species to always exhibit strong CNDD. 

In particular, they argue that our CNDD 
estimates for rare species are biased because 
an offset value was added to retain recruits (y > 0 
sapling density) located in 20 m x 20 m quadrats 
without a conspecific adult (v = 0 adult density). 
Retention of all recruits was important because 
our data demonstrate that recruits in quadrats 
without adults likely came from adults in ad- 
jacent quadrats (3). To avoid bias that results 
from excluding these recruits (4), we therefore 
added a minimum adult density of 0.1 (offset 
value) to quadrats containing recruits but no 
conspecific adults. This approach allowed us 
to include all quadrats and meet the assumptions 
of a conventional Ricker model that fixes the in- 
tercept at the origin (5). This is what Chisholm and 
Fung refer to as “selective data transformation.” 
Three lines of evidence refute Chisholm and 
Fung’s claim. First, if their claim is correct, then 
our approach should have always produced strong 
estimates of CNDD for rare species regardless of 
their latitude. Yet our estimates of CNDD vary 
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substantially among rare species when they are 
matched for abundance [figure 2D in (2); see 
below]. Moreover, median CNDD for rare species 
differed across latitudes, with rare tropical species 
having stronger median CNDD than rare tem- 
perate species [figure 2C in (2)]. Second, add- 
ing the offset value to all quadrats does not 
qualitatively change either the relationship be- 
tween species rarefied richness and CNDD ac- 
ross latitudes (r = -0.877, P < 0.001) or the 
latitudinal shift in the relationship between 
CNDD and species abundance (7 = -0.552, P = 
0.006). Third, these findings persist using an 
alternative distance-weighted approach to esti- 
mate CNDD that avoids the use of an offset 
altogether (described in detail below). 

In contrast, Chisholm and Fung offer an al- 
ternative approach to retain all recruits by in- 
cluding an additional additive intercept in the 
Ricker model (Fig. 1). However, their modifi- 
cation of the Ricker model and inferences made 
from it are flawed for two reasons. First, the 
extra intercept in Chisholm and Fung’s model 
is confounded with the CNDD parameter (and 
recruitment parameter, or 7) such that the same 
reduction in recruitment with increasing adult 
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densities (i.e., CNDD) can be described by a wide 
range of CNDD-parameter values (Fig. 1C). This 
problem is similar to another flawed attempt to 
add a third parameter to the Ricker model, which 
has been demonstrated to produce strongly biased 
estimates of CNDD (6). Second, the Ricker model 
already has an intercept that measures density- 
independent recruitment (7) (Fig. IB). The addition 
of a second intercept removes a constant propor- 
tion of recruits from the calculation of the CNDD 
parameter in the Ricker model, thereby assuming 
that these recruits are immune to the influence of 
neighboring adults. This assumption is flawed 
because all recruits (even those that immigrate 
from outside the forest plot) are subject to poten- 
tial effects from intraspecific competition and 
shared natural enemies associated with neigh- 
boring adult trees (7, 8). Thus, the addition of an 
extra-intercept parameter suggested by Chisholm 
and Fung fundamentally alters the Ricker model 
so that it no longer measures the biological pro- 
cess of interest: CNDD. 

Benchmark tests, which quantify a model’s 
ability to recover known parameter values, fur- 
ther demonstrate that Chisholm and Fung’s 
extra-intercept Ricker model produces severely 
biased estimates of CNDD. Building on bench- 
mark tests provided in our original paper (2), 
we simulated data with known values of CNDD 
across the range of recruitment and mean adult 
abundances observed in the data while incor- 
porating uncertainty due to immigration and 
random error (2, 9). We then tested the ability 
of the extra-intercept model to recover these 
known CNDD values. CNDD estimates from the 
extra-intercept model were severely biased, im- 
precise, and centered at zero, regardless of the 
known CNDD value (Fig. 2, A and C). Indeed, 
Chisholm and Fung state that their approach 
resulted in CNDD being generally weak across 
species, which contradicts previous experimental 
demonstrations of strong CNDD for several 
tropical and temperate species in our study 
(10-12). This explains why they show no patterns 
in CNDD across species or latitudes. Moreover, 
estimated values of Chisholm and Fung’s extra- 
intercept parameter were not associated with 
known values of immigration in the benchmark 
tests (7 = 0.010, P = 0.089), which suggests that 
the extra intercept does not reflect immigration 
as they claim. 

Inspired by Chisholm and Fung’s concern, we 
used an alternative approach to estimate CNDD 
that avoids the use of an offset. This approach 
yields relationships between CNDD and latitude, 
and between CNDD and species abundance, 
nearly identical to those in our original paper. 
This approach uses a distance-weighted measure 
of adult abundance for each quadrat, heavily 
weighting the number of conspecific adults in 
that quadrat but also incorporating (with lesser 
weights) the number of conspecific adults in 
neighboring quadrats (Fig. 3). To weight adult 
abundances, we used Clark’s 2dT dispersal kernel 
because it is well supported in previous studies of 
seed dispersal and reflects the probability that a 
sapling recruit in a focal quadrat came from an 
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adult at a given distance (13, 14). This kernel 
also reflects the decreasing influence with dis- 
tance that an adult may have on recruits in a 
focal quadrat via intraspecific competition or 
shared natural enemies (7, 8). We used a mean 
dispersal-distance parameter of 30 m for the 
2dT kernel, which is the mean dispersal distance 
estimated across more than 60 species in two of 
the largest studies of seed dispersal in tropical 
and temperate forests (13, 14) (Fig. 3A). The 
distance-weighted approach yields only non- 
zero values of adult abundance, reflecting the 
nonzero probability that recruits from any adult 
can potentially disperse to any quadrat in these 
4- to 50-ha forest plots, and thus negates the 
need for an offset value. Moreover, because there 
are only nonzero values of adult abundance (no 


A Ricker model B 
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Recruit density (R) 
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Fig. 1. Basic parameterization of the Ricker model and comparison 


to the model proposed by Chisholm and Fung. 
model measures linear changes in log-transforme' 
with increasing adult densities (5). The intercept, 
independent per capita recruitment (r), and the s 
density-dependent recruitment (CNDD). (©) Com 
model (red) to the model suggested by Chisholm 


data at x = 0), this approach meets the assump- 
tions of a conventional Ricker model that fixes 
the intercept at the origin (5). 

Results using this distance-weighted approach 
support our original conclusions that CNDD is 
stronger in tropical than in temperate forests 
and that the relationship between CNDD and 
species abundance changes with latitude (Fig. 4). 
Null-model analyses conducted using the distance- 
weighted approach further demonstrate that 
these results are robust to the influence of other 
processes that might affect the spatial distri- 
bution of recruits relative to adults (e.g., habitat 
specificity, adult/recruit ratios, species abun- 
dances) (3). Moreover, benchmark tests that in- 
corporate uncertainty due to immigration and 
random error (described above) demonstrate 
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that the distance-weighted approach accurately 
recovers known values of CNDD (Fig. 2, B and 
D). In summary, regardless of whether we use 
(i) the distance-weighted approach (which requires 
no offset value), (ii) the Ricker model with an 
offset applied to quadrats with recruits but no 
adults (2), (iii) the Ricker model with an offset 
applied to all quadrats (described above), or 
(iv) an offset-power model (2), we find the same 
qualitative results as those presented in our 
original paper (2). 

In conclusion, we demonstrate that our key 
findings are not an artifact of systematic bias. We 
also demonstrate that the alternative approach 
proposed by Chisholm and Fung fundamentally 
alters the Ricker model so that it no longer mea- 
sures CNDD accurately or precisely, explaining 
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adds an intercept, /, to the right side of the equation in (A). / is 
mathematically confounded with a and b, similar to other attempts that 
add a third parameter to the Ricker model (6), such that a wide range 
values of a and b can describe the same density dependence in 
recruitment. Hence, the five overlapping blue curves shown in (C) can be 
described by a wide range of a and b values. That is not the case with the 
conventional Ricker model (red lines). 
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Fig. 2. Benchmark tests that attempt to recover known values of 
CNDD from simulated data with error. (A and B) Tests of the ability of 
the extra-intercept approach proposed by Chisholm and Fung (A) and the 
distance-weighted adult abundance approach (B) to recover known values of 
CNDD across a wide range of CNDD, r, mean adult abundance, random error, 
and immigration values. An unbiased approach should produce estimates 
that fall on the identity line (diagonal black dashed lines). The greater spread 
of estimated values in (A) relative to (B) reflects the confounded nature of 
the CNDD and extra-intercept parameters in the model proposed by 


Known CNDD value 


-10 
Density-independent recruitment 


Chisholm and Fung. (C and D) One thousand iterations of the same 
benchmark test at one set of known parameter values (i.e., known values in 
red). Histograms show the distribution of estimated values from the Ricker 
model using the extra-intercept approach (C) and the distance-weighted 
approach (D). The distribution of estimated values should fall around the 
known value if the model is not biased. The extra-intercept model tends to 
bias CNDD weak (i.e., closer to zero) and tends to bias r low. The extra- 
intercept model also estimates a wide range of nonzero intercepts, even 
though these simulated data were generated without an intercept. 
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Fig. 3. Distance-weighted measure of adult abundance. (A) Distance 
kernel used to weight adult abundances as a function of distance from a 
focal quadrat. We used the mean dispersal distance (30 m, red curve) 
across more than 60 tropical and temperate species estimated from 
Clark's 2dT kernel (13, 14), but inferences were similar using more extreme 
values [i.e., 10 m (blue curve), 60 m (orange curve)]. (B) The distance- 
weighted approach is advantageous because it provides information on 
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adults in neighboring quadrats. Whereas both quadrats 1 and 2 have naive 
adult abundances of zero, the distance-weighted adult abundance of 
quadrat 1 would be greater than quadrat 2 because of the presence of six 
adults in the neighboring quadrat. (C) Example of distance-weighted adult 
abundance values (color scale) for Cassipourea elliptica at Barro Colorado 
Island, Panama (range of distance-weighted adult abundance values is 0.003 
to 10.48). Black dots are adult individuals of this species. 
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Fig. 4. Results using the Ricker model with distance-weighted mea- 
sure of adult abundance. (A) Species rarefied richness increased with the 
median strength of CNDD across tropical and temperate forest plots. 

(B) Median CNDD was stronger for rare species in tropical than in temperate 
forests. (C) Estimates of CNDD plotted against species abundance for all 
species and forests (tropical species in red/orange; temperate species in 


why their method shows no latitudinal patternin | 5. 
CNDD. Finally, we show that our key findings— 

(i) tree species diversity increases with the strength o 
of CNDD across temperate and tropical forests, | 7. 
and (ii) there is a latitudinal shift in the relation- 8. 
ship between CNDD and species abundance— 

are supported using a distance-weighted approach 9 
that addresses the methodological concerns raised 

by Chisholm and Fung. 10. 
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blue/purple; mid-latitude species in green). (D) The slope between CNDD and 
species abundance decreased with increasing latitude. These results support 
the main results in our original paper [figures 1 and 2 in (2)]. Colors reflect 
distance from the equator [see captions of figures 1 and 2 in (2)]. Results 
are shown at the 20 m x 20 m scale. Linear fits are shown, along with 
Spearman rank correlation coefficients and their P values. 
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AAAS NEWS & NOTES 


Global health shifts to local 
experts with global partners 


Local health specialists seek international 
collaborations to fight health emergencies 


By Anne Q. Hoy 


A decade and a half ago, severe acute respiratory syndrome (SARS) 
took hold in the coastal Chinese province once known as Canton. 
Within months, the respiratory virus had spread across four conti- 
nents. The U.S. Centers for Disease Control and Prevention issued 
its first travel warning. In time, 774 people had died and 8,098 were 
sickened by the outbreak. 

The response of governments and global public health organiza- 
tions to the world’s first known instance of atypical pneumonia has 
informed responses to more recent global health epidemics, includ- 
ing the Ebola outbreak in West Africa from 2014 to 2016 and the Zika 
virus in Brazil in 2015. 

At the center of the U.S. response to the outbreaks was Jimmy 
Kolker, now a visiting scholar at the American Association for the Ad- 
vancement of Science’s Center for Science Diplomacy. At the time, 
he was the assistant secretary for global affairs at the U.S. Health 
and Human Services Department, a position to which he brought 
experience as the former ambassador to Burkina Faso from 1999 
to 2002 and later to Uganda from 2002 to 2005. 

Kolker traced the evolution of global health on 4 May as part of the 
AAAS-Hitachi Lecture on Science and Society, at AAAS’s Washing- 
ton, D.C., headquarters. The annual lecture series is a decade-long 
collaboration between AAAS and Hitachi Ltd., the global technology 
and innovation company. 

As the U.S. ambassador to Uganda seeking to implement the 
President’s Emergency Plan for AIDS Relief, Kolker said diplomacy 
was “a crucial but ill-defined” requirement. Applying policies based 
on science and evidence in Uganda required weighing uncertainties 
with limited data, a topic Kolker explored in an essay recently pub- 
lished in Science & Diplomacy. 

Global health crises require international health institutions, 
governments, scientists, and diplomats to work together. Yet, this 
growing but still tiny field of health diplomacy increasingly needs sci- 
entists skilled in translating research into actionable policy options— 
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skills AAAS has long worked to hone through multiple science policy, 
engagement, and diplomacy programs designed to help scientists 
effectively place research findings into context to ensure that sci- 
ence informs policy. “The data rarely speaks for itself,” Kolker said. 

The interplay of science and diplomacy takes on many forms. The 
SARS epidemic in 2003, for instance, underscored the necessity of 
open public and international communications, a practice the Chinese 
government initially discounted. After SARS, the CDC embedded dis- 
ease surveillance and detection experts with Chinese counterparts. 

“The system that was set up, the protocols that were put in place 
meant that when the next potential pandemic, H7N9, broke out, the 
Chinese response was timely and enabled governments and experts 
around the world to bring to bear the knowledge needed to control 
the outbreak,” Kolker said. 

At the outset of the Zika outbreak in Brazil, the government hesi- 
tated to share samples necessary to develop blood bank screening, 
diagnostics, and vaccines. A U.S. team from the CDC, the National 
Institutes of Health, and other agencies responded to the Brazilian 
government's request for an experts’ meeting. Kolker, who led the 
group, said the meeting “changed the dynamic” and produced a plan 
that gave political approval for Brazil's top health institutions to deal 
directly with American counterparts to streamline cooperation on 
Zika research, countermeasures, and field studies. 

The unexpected 2014 Ebola outbreak in West Africa highlighted 
the benefit of an earlier collaboration between the CDC and Ugandan 
scientists dating from what was then the world’s largest Ebola out- 
break in 2000. The CDC helped Uganda develop its own world-class 
Ebola laboratory and response center at the Uganda Virus Research 
Institute. During the West African outbreak that killed thousands, a 
single case of Ebola in Uganda was quickly diagnosed, and contacts 
were traced and isolated. 

“There was one death in Uganda,” said Kolker. “The methods were 
in place to deal with the outbreak. We didn’t hear about it because 
Uganda had world-class capacity and did not require outside or 
emergency help.” 

Emerging economies, as the Ugandan outcome demonstrates, 
are not looking for “donors to provide aid;” instead, Kolker said, 
they want to work as partners with experts to help build First World 
capacity in their own health systems and institutions. 

Against this backdrop, U.S. funding for international health secu- 
rity is being squeezed, Kolker added, including programs designed 
to assist emerging economies to meet World Health Organization 
standards to prevent and respond to global health emergencies. 

The White House proposed deep cuts to global health programs in 
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Hong Kong passengers don masks 
during the 2003 SARS outbreak. 


its fiscal 2019 budget proposal, in- 
cluding programs that address HIV/ 
AIDS, malaria, immunizations, and 
parasitic diseases, said David Parkes, program associate of AAAS's 
R&D Budget and Policy Program. The president's budget plan would 
cut these CDC programs alone by a total of $80 million, taking fund- 
ing levels 16.3% below enacted fiscal 2018 levels. 

The fiscal 2019 budget proposal also called for a 36% reduction 
below enacted fiscal 2018 levels for the U.S. Agency for International 
Development. It would cut 17% from a CDC infectious disease pro- 
gram that develops tools to stop diseases spread between animals 
and people, reported the Global Health Technologies Coalition, a 
group dedicated to advancing deadly disease treatments and diag- 
nostic tools. 

Budget constraints and global political trends heighten the need 
for organizations like AAAS, academic institutions, philanthropies, 
and the private sector to forge partnerships with emerging econo- 
mies, Kolker said. Technical partnerships are needed to further 


expand health care infrastructure and medical expertise. Several 
AAAS science diplomacy training and public engagement programs 
contribute to meeting this goal but will be hard put to fill gaps if 
government commitment is reduced, Kolker added. 

In discussing his experience with epidemics, Kolker explained 
how multidisciplinary and multinational approaches are especially 
beneficial as researchers in emerging economies increasingly want to 
select the health experts they partner with and the research they want 
to pursue. Such collaborations have led research teams to no longer 
focus exclusively on infectious diseases. Increasingly, they also study 
chronic diseases such as diabetes, heart disease, and cancer that 
present growing health burdens in their countries. 

“Our health research system has led U.S. institutions to use Afri- 
can institutions as research platforms. This underestimates the abil- 
ity of Third World institutions to actually identify their own priorities,” 
said Kolker. “In the 21st century, the new paradigm is that lower- and 
middle-income countries actually want partnerships with the world’s 
best experts to bring their own capacity to First World standards.” 


AAAS extends science in theological education program 


Science in the Seminaries enhances science education across the religious community 


By Anne Q. Hoy 


Building on broad interest generated by a three-year pilot project 
integrating science into theological education, the American Associa- 
tion for the Advancement of Science is now expanding the initiative to 
advance understanding of science and technology across the religious 
community to as many as 35 seminaries over the next 5 years. 

A set of seven seminaries in Michigan, Missouri, Indiana, Illinois, 
and Wisconsin have been selected as the first of four groups to par- 
ticipate in the expanded program over the next 
18 months. Three additional groups of seminar- 
ies will be chosen to join the program in stages. 

“The hope is that the seminary students 
exposed to enriched classes will find science 
relevant and interesting to their vocations, and 
in the future, help them make science a posi- 
tive force for their congregations and favorably 
impact the everyday lives of a broad swath of 
Americans,” said Jennifer Wiseman, director 
of the AAAS Dialogue on Science, Ethics, and 
Religion (DOSER) program. 

Fr. John Kartje, a trained astrophysicist, 
rector, and president of Mundelein Seminary, 

a Roman Catholic theological seminary in a 
northern Chicago suburb, expressed hope 

that his school’s participation in the expansion 
phase will give students and faculty a “deeper 
knowledge and better appreciation for the 
findings and methodologies of scientific research.” 

“As Mundelein Seminary prepares priests who will be serving in 
parishes across the country, it is important that they possess a basic 
foundation in the fundamentals of science and are up to date on cur- 
rent trends in research and discovery,” Kartje said. “Such ‘scientific 
fluency’ will help them better connect with their parishioners and 
better integrate their theological expertise into the scientifically 
savvy culture in which they will minister.” 

Anna Case-Winters, professor of theology at McCormick Theological 
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Screeners needed for 
journalism awards 


Scientists from the United States 
and abroad who will be in the 
Washington, D.C., area between 
late August and late September 

are needed to review the scientific 
accuracy of entries in the prestigious 
AAAS Kavli Science Journalism 
Awards competition. If you can 
volunteer, please contact Nkongho 
Beteck (nbeteck@aaas.org) for 
screening dates and categories. 


Seminary, said the chance to participate in the initiative drew im- 
mediate faculty support from the U.S. Presbyterian theological 
seminary located in Chicago. “We believe that for religious leaders 
today, capacity for thoughtful interaction with the sciences is not 
optional—it is essential,” Case-Winters said. 

Participating seminaries have pledged to incorporate science into 
at least two of their core courses and to hold at least one campus-wide 
event over the next 18 months. “We provide science resources and 
they plug that into the larger context of their programs,” said Curtis 
Baxter, a DoSER program associate. “The semi- 
naries decide on their own how to incorporate 
the science into courses they already teach.” 

AAAS will recruit science advisers from 
nearby research and academic institutions to 
share knowledge and experience in designing 
engaging science coursework, assisting theo- 
logical educators at each seminary to sort out 
how best to integrate science into courses the 
seminaries have selected. The program makes 
available information on advances in science 
and technology and provides the institutions 
with access to the Science family of journals. 

Coursework that seminaries plan to fold 
into their core classes covers a broad sweep of 
science, ranging from evolution of the cosmos 
to genetics and neuroscience. One seminary 
plans to explore topics such as anxiety, addic- 
tion, and dementia to prepare church leaders 
to better minister to congregants. Some seminaries plan hands-on 
activities, including a pilgrimage to Israel to study archaeological 
sites and field trips to laboratories and science museums. 

The five other seminaries participating are the Seventh-Day 
Adventist Theological Seminary of Andrews University in Berrien 
Springs, Michigan; Nazarene Theological Seminary in Kansas City, 
Missouri; Bethany Theological Seminary in Richmond, Indiana; 
Kenrick-Glennon Seminary in St. Louis, Missouri; and the Sacred 
Heart Seminary and School of Theology in Hales Corners, Wisconsin. 
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A freshwater turtle known as the 


PALEOCLIMATE 
Forcing the East Asian 
summer monsoon 


What factors have controlled 
the intensity of the East Asian 
summer monsoon over the 
recent geological past? To 
answer this key question 
requires a robust proxy for 
rainfall amounts. Beck et al. 
measured the beryllium isoto- 
pic content of loess from China, 
from which they reconstructed 
a 550,000-year-long record 

of rainfall. Rainfall correlated 
with orbital precession and 
global variations in ice volume. 
This finding suggests that 

the monsoon is governed by 
low-latitude interhemispheric 


870 


red-eared slider (Trachemys scripta elegans) 
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gradients in solar radiation 
levels, rather than by high- 
northern-latitude solar radiation 
levels as previously suggested. 
—HJS 


Science, this issue p. 877 


VALLEYTRONICS 
Tracking the 
spin-valley current 


Taking advantage of the elec- 
tron’s spin and valley degrees of 
freedom requires a method for 
generating currents of carriers 
that have a particular spin or 
come from a particular valley 

in the electronic structure. Jin 
et al. used a hetero-structure 
made out of adjacent layers 


Secondary structures { } 
of RNAcondensates @ 
Langdon et al., p. 922 


of WSe, and WS, to create a 
spin-valley diffusion current 
without applying an external 
electric field. Instead, they used 
circularly polarized laser light 
to initiate the diffusion and a 
second laser pulse to image 
the propagation of the carriers. 
With long lifetimes and diffusion 
lengths, the method may be of 
practical use in future valley- 
tronic devices. —JS 

Science, this issue p. 893 


COLD MOLECULE PHYSICS 
Lighting the way to 
molecules, one by one 


When chemists run reactions, 
what they are really doing is 
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NEURODEVELOPMENT 


Evolution of the brain 


ust how related are reptil- 

jan and mammalian brains? 

Tosches et al. used single-cell 

transcriptomics to study tur- 

tle, lizard, mouse, and human 
brain samples. They assessed how 
the mammalian six-layered cortex 
might be derived from the reptilian 
three-layered cortex. Despite a 
lack of correspondence between 
layers, mammalian astrocytes 
and adult neural stem cells shared 
evolutionary origins. General 
classes of interneuron types were 
represented across the evolu- 
tionary span, although subtypes 
were species-specific. Pieces of 
the much-folded mammalian 
hippocampus were represented 
as adjacent fields in the reptile 
brains.—PJH 


Science, this issue p. 881 


mixing up an enormous num- 
ber of reacting partners and 
then hoping that they collide 
productively. It is possible to 
manipulate atoms more deliber- 
ately with a scanning tunneling 
microscope tip, but the process 
is then confined to a surface. 
Liu et al. directly manipulated 
individual atoms with light to 
form single molecules in isola- 
tion (See the Perspective by 
Narevicius). They used optical 
tweezers of two different colors 
to selectively steer ultracold 
sodium (Na) and cesium (Cs) 
atoms together. A subsequent 
optical excitation formed NaCs. 
—JSY 


Science, this issue p. 900; 
see also p.855 
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Using bugs in the gut 
to detect blood 


Bacteria are environmentally 
resilient and can be engineered 
to sense various biomolecules. 
Mimee et al. combined biosensor 
bacteria with a miniaturized wire- 
less readout capsule to produce a 
minimally invasive device capable 
of in vivo biosensing in harsh, 
difficult-to-access environments 
(see the Perspective by Gibson 
and Burgell). The device success- 
fully measured gastrointestinal 
bleeding in pigs. —SYM 

Science, this issue p. 915; 

see also p. 856 


Astimulating therapy 
for diabetes 


In type 2 diabetes, insulin resis- 
tance leads to elevated blood 
glucose and increased risk of 
cardiovascular disorders. The 
brain participates in glucose 
metabolism, but whether and 
how modulation of brain activity 
affects systemic blood concen- 
trations of glucose is poorly 
understood. In diabetic and 
nondiabetic patients, ter Horst et 
al. found that dopamine release 
induced by deep brain electrical 
stimulation of the nucleus accum- 
bens decreased systemic glucose 
concentrations. Conversely, phar- 
macological systemic dopamine 
depletion reduced insulin-medi- 
ated blood glucose uptake. —-MM 
Sci. Transl. Med. 10, eaar3752 (2018). 


A faster way to detect 
Zika in mosquitoes 


A major challenge in prevent- 

ing mosquito-borne diseases 

is providing rapid and afford- 

able tests to identify infected 
insects. Conventional techniques 
are often time-consuming and 
too expensive when analyzing 
large numbers of mosquitoes. 
Leveraging a technique known 

as near-infrared spectroscopy 
(NIRS), Fernandes et al. tested for 
Zika virus in female Aedes aegypti 
mosquitoes. NIRS distinguished 
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infected from uninfected mosqui- 
toes with up to 99% accuracy 7 
days after infection. NIRS is faster 
by a factor of 18 and cheaper by 
a factor of 110 than RT-qPCR 
(quantitative reverse transcrip- 
tion polymerase chain reaction), 
a technique commonly used for 
pathogen screening in 
mosquitoes. —PJB 
Sci. Adv. 10.1126/ 
sciadv.aat0496 (2018). 


Bile acids and liver cancer 
Liver cancer is a leading cause 
of cancer-related deaths in the 
United States. The composition 
of the gut microbiome influences 
many human diseases, including 
liver inflammatory disorders. Ma 
et al. found that commensal gut 
bacteria can recruit the immune 
system to control the growth 
of liver tumors in mice (see the 
Perspective by Hartmann and 
Kronenberg). Clostridium species 
modified bile acids to signal liver 
sinusoidal endothelial cells to 
produce the chemokine CXCL16. 
This recruited natural killer T 
(NKT) immune cells to perform 
antitumor surveillance of the 
liver. Growth of both primary and 
metastatic cancer was reduced 
by NKT cell—driven killing. —PNK 
Science, this issue p. 876; 
see also p. 858 


Follow the leader 
What role do social dynamics 
play in guiding collective migra- 
tions? Identifying such dynamics 
requires following individual 
animals across long migratory 
distances. Flack et al. used GPS 
tags to follow individual juvenile 
white storks on their southern 
migration (see the Perspective 
by Nevitt). Birds generally fell 
into two categories: leaders and 
followers. Leaders sought out 
areas of thermal uplift, flapped 
less in transit, and flew farther. 
Followers followed leaders into 
thermals but had different 
trajectories, exhibited greater 
flapping effort, and flew shorter 
total distances. —SNV 

Science, this issue p. 911; 

see also p. 852 


Premature graying <= 
of hair is related 
to'immune function. 


HAIR COLOR 


Edited by Sacha Vignieri 
and Jesse Smith 


The roots of gray hair 


ray hair is an inevitable part of aging. Melanocytes are 
the culprit cells that slow production of the color pig- 
ments called melanin, but how and why this occurs with 
age largely remains a mystery. Harris et al. make a link 
between the immune system and premature graying. 
They find that the protein MITF (microphthalmia-associated 
transcription factor), which controls melanocyte stem cell 
function, also works to trigger melanocyte immune responses. 
Interferons normally kickstart the immune response to 
viral and bacterial infection, but when MITF cannot regulate 
interferon, hair turns gray in mouse models. These findings 
may shed light on why chronic illness or certain autoimmune 
disorders can accelerate the graying process. —PNK 


PLOS Biol. 10.1371/journal.pbio.2003648 (2018). 


Aluminum’s breakup 
with fluoroalkenes 


Carbon-fluorine bonds are hard 
to break. As a result, remediation 
of fluorocarbon waste streams is 
an enduring challenge. Bakewell 
et al. explored C-F scission in a 
variety of fluorinated propene 
derivatives by using an unusual 
synthetic aluminum compound. 
The electron-rich compound, 
with Al in the +1 oxidation state, 
inserted into both olefinic and 
allylic C-F bonds to form Al(III) 
products that were characterized 
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crystallographically. Theory 
implicated two simultaneous 
competing mechanisms, respec- 
tively involving stereoretentive 
direct oxidative addition and 
stereoinverting intermediacy of a 
metallocyclopropane. —JSY 
Angew. Chem. Int. Ed. 10.1002/ 
anie.201802321 (2018). 


Better to transfer 
than transfuse? 


6-Thalassemia is a blood 
disease caused by mutations 
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Agene controlling 
Wolbachia proliferation 
in Nasonia wasps is 
under positive selection. 


How hosts can defeat selfish elements 


olbachia bacterial infections are horizontally passed through the eggs of nematodes and 

insects and can selfishly affect reproductive outcomes, resulting in an increased number of 

female offspring. Because Wolbachia can affect reproduction, it has been eyed as a potential 

gene drive system to eliminate disease-carrying vectors, such as mosquitos. Crossing 

species of Nasonia wasps that maintain differing levels of Wolbachia, Funkhouser-Jones et 
al. mapped a gene named Wolbachia density suppressor that controls Wolbachia proliferation within 
hosts. Further investigation suggested that this gene is under positive selection. This adds to our 
understanding of the dynamics between hosts and selfish parasites such as Wolbachia and may 
provide information of interest for the design of gene drive systems. —LMZ 


Curr. Biol. 10.1016/j.cub.2018.04.010 (2018). 


in the B-globin gene. B-Globin 

is asubunit of hemoglobin, 

an oxygen-carrying protein in 

red blood cells. Patients with 
§-thalassemia are severely 
anemic and require lifelong trans- 
fusions of red blood cells. In two 
clinical trials involving a total of 22 
patients, Thompson et al. tested 

a gene therapy—based treatment 
that might eliminate the need 

for repeated transfusions. They 
isolated hematopoietic stem cells 
from the patients, used a lentiviral 
vector to transfer a “normal” copy 
of the B-globin gene into the cells, 
and then infused the modified 
cells back into the patients. After 
26 months, the patients showed 
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sustained expression of the trans- 
genic hemoglobin, and nearly 
70% of them no longer required 
transfusions. —-PAK 


New Engl. J. Med. 378, 1479 (2018). 


How drinking (alcohol) 
affects drinking (water) 
The hormone FGF21 (fibroblast 
growth factor 21) is a major reg- 
ulator of drinking in mice. FGF21 
is produced in the liver and links 
metabolic status to behavioral 
responses through actions in 
the brain. It has many effects on 
metabolism, including regulation 


of preference for sugar con- 
sumption. Song et al. found that 
various stresses, such as alcohol 
consumption or a ketogenic 
diet, caused increased produc- 
tion of FGF21 and stimulation of 
water drinking. Although alcohol 
consumption has well-known 
acute effects on water balance 
through inhibition of antidiuretic 
hormone action, this work shows 
an important role for FGF21 in 
stimulating water drinking after 
alcohol consumption. FGF21 can 
even suppress preference for 
alcohol consumption in favor of 
pure water. —LBR 
Cell Metab. 10.1016/ 
j.cmet.2018.04.001 (2018). 
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Something from nothing 
Specialized enzymes often 
originate from the refinement 
of a promiscuous enzyme by 
evolution, rather than through 
the emergence of activity from 
an inactive protein. Chalcone 
isomerase (CHI) catalyzes 
a simple chemical reaction 
but emerged within a protein 
family whose other extant 
members have no known enzy- 
matic function. Kaltenbach 
et al. used phylogenetic 
reconstruction to predict the 
sequence of ancestors along 
the course of CHI evolution. 
The distant ancestor of the 
protein family had no CHI 
activity, despite present- 
ing catalytic residues within 
an active site-like pocket. 
Asmall number of periph- 
eral mutations could induce 
CHI activity in the ancestral 
protein, unlocking the catalytic 
potential already present in the 
binding site. —MAF 

Nat. Chem. Biol. 10.1038/s41589- 

018-0042-3 (2018). 


Simulating the future 
of our Universe 


The energy densities of mat- 
ter and dark energy in our 
Universe are approximately 
equal at the current epoch. 
Testing whether this is a 
coincidence requires running 
cosmological simulations into 
the future, something that 
has generally been avoided 
because only the past can be 
observed. Salcido et al. simu- 
lated the futures of universes 
with and without dark energy, 
assessing its impact on the 
total star formation within 
each universe. The presence of 
dark energy has surprisingly 
ittle effect, because most stars 
have formed before it becomes 
dominant. Eighty-eight percent 
of all stars that will ever form 
in our Universe have done so 
already. The results suggest 
that the current epoch is not 
particularly special. —KTS 

Mon. Not. R. Astron. Soc. 10.1093/ 

mnras/sty879 (2018). 
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INORGANIC CHEMISTRY 
Transforming nitrogen 
without carbon 


How much carbon does it take to 
make nitric acid? The counterin- 
tuitive answer nowadays is quite 
a lot. Nitric acid is manufactured 
by ammonia oxidation, and all 
the hydrogen to make ammonia 
via the Haber-Bosch process 
comes from methane. That's 
without even accounting for 

the fossil fuels burned to power 
the process. Chen et al. review 
research prospects for more 
sustainable routes to nitrogen 
commodity chemicals, consider- 
ing developments in enzymatic, 
homogeneous, and hetero- 
geneous catalysis, as well as 
electrochemical, photochemical, 
and plasma-based approaches. 
—JSY 


Science, this issue p. 873 


SINGLE-CELL ANALYSIS 
Mapping the planarian 
transcriptome 


A cell type’s transcriptome 
defines the active genes that 
control its biology. Two groups 
used single-cell RNA sequenc- 
ing to define the transcriptomes 
for essentially all cell types of 
acomplete animal, the regen- 
erative planarian Schmidtea 
mediterranea. Because plu- 
ripotent stem cells constantly 
ifferentiate to rejuvenate any 
art of the body of this species, 
| developmental lineages 

re active in adult animals. 
incher et al. determined the 
transcriptomes for most, if not 
all, planarian cell types, includ- 
ing some that were previously 
unknown. They also identified 
transition states and genes 
governing positional informa- 
tion. Plass et al. used single-cell 
transcriptomics and computa- 
tional algorithms to reconstruct 
a lineage tree capturing the 
developmental progressions 
from stem to differentiated cells. 
They could then predict gene 
programs that are specifically 
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turned on and off along the tree, 
and they used this approach to 
study how the cell types behaved 
during regeneration. These 
whole-animal transcriptome 
“atlases” are a powerful way to 
study metazoan biology. —BAP 
Science, this issue p. 874, p.875 


CATALYSIS 
Reduction can make 
cobalt act precious 


Enzymes rely on abundant met- 
als such as iron and nickel to 
manipulate hydrogen. Chemists, 
on the other hand, have largely 
had to rely on precious metals 
such as platinum and rhodium 
for the task. Friedfeld et al. now 
report a simple trick to make 
cobalt act more like rhodium. 
Reduction of Co(II) to Co(l) 

by zinc reinforced binding of 
phosphine ligands to the metal 
to facilitate its use in asymmetric 
hydrogenation of alkenes. The 
cobalt catalysts tolerated alcohol 
solvents, unlike their rhodium 
congeners, and could be applied 
to a200-gram-scale reduction 
at 0.08% loading. —JSY 


Science, this issue p. 888 


PHOTOVOLTAICS 
Poking a semiconductor 


Noncentrosymmetric crystal 
structure can lead to a peculiar 
kind of charge separation under 
illumination called the bulk 
photovoltaic (BPV) effect. Solar 
cells made of such materials, 
however, typically have low 
efficiency. Yang et al. expanded 
the class of materials capable 
of exhibiting the BPV effect 

by making ordinarily centro- 
symmetric materials, such 

as SrTiO, and TiO,, lose their 
inversion symmetry. The authors 
accomplished this by applying a 
point force on the surface of the 
material. This induced a strain 
gradient and the loss of inversion 
symmetry, resulting in large 
photovoltaic currents under 
illumination. The mechanism, 
dubbed the flexo-photovoltaic 
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effect, is expected to apply to 
most semiconductors. —JS 
Science, this issue p. 904 


ORGANIC ELECTRONICS 
A longer exciton pathway 


Organic semiconductors 
typically exhibit exciton diffu- 
sion lengths on the order of 
tens of nanometers. Jin et al. 
prepared nanofibers from block 
polymers consisting of emissive 
polyfluorene cores surrounded 
by coronas of polyethylene 
glycol and polythiophene (see 
the Perspective by Holmes). 
Excitons generated in the 
polyfluorene cannot enter the 
polyethylene glycol layer and so 
diffuse more than 200 nm. This 
distance can be tuned by varying 
the length of the polyethylene 
glycol—a feature that could 
potentially be exploited in the 
development of organic devices 
such as photovoltaics. —MSL 
Science, this issue p. 897; 
see also p. 854 


MOLECULAR BIOLOGY 
RNA and membraneless 
organelles 


Membraneless compartments 
can form in cells through liquid- 
liquid phase separation (see the 
Perspective by Polymenidou). 
But what prevents these cellular 
condensates from randomly 
fusing together? Using the RNA- 
binding protein (RBP) Whi3, 
Langdon et al. demonstrated 
that the secondary structure 

of different RNA components 
determines the distinct biophysi- 
cal and biological properties of 
the two types of condensates 
that Whi3 forms. Several RBPs, 
such as FUS and TDP43, contain 
prion-like domains and are 
linked to neurodegenerative 
diseases. These RBPs are usually 
soluble in the nucleus but can 
form pathological aggregates in 
the cytoplasm. Maharana et al. 
showed that local RNA con- 
centrations determine distinct 
phase separation behaviors in 
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different subcellular locations. 
The higher RNA concentrations 
in the nucleus act as a buffer 
to prevent phase separation of 
RBPs; when mislocalized to the 
cytoplasm, lower RNA concen- 
trations trigger aggregation. 
—SYM 

Science, this issue p. 922, p. 918; 

see also p. 859 


CANCER IMMUNOLOGY 
Killing without 
poking holes 


Given the success of T cell- 
centric cancer immunothera- 
pies, there is considerable 
interest in understanding exactly 
how tumors sometimes evade 
this form of treatment. Kearney 
et al. carried out a series of 
genome-wide CRISPR screens 
to identify mechanisms of 
tumor immune evasion from 
cytotoxic T lymphocytes (CTLs) 
and natural killer (NK) cells. 
Interferon-y signaling and anti- 
gen presentation were critical for 
CTL-mediated killing of cancer 
cells, and TNF (tumor necrosis 
factor) signaling was a key effec- 
tor mechanism for both CTL and 
NK cell antitumor activity. The 
same immune evasion mecha- 
nisms arose upon screening with 
perforin-deficient CTLs, sug- 
gesting that tumors evade the 
immune system by dampening 
the effects of cytokines, rather 
than reducing direct killing via 
perforin. —AB 

Sci. Immunol. 3, eaar3451 (2018). 


EVOLUTION 
Multiple parasites speed 
host evolution 


Virtually all organisms are 
parasitized by multiple species, 
but our current understanding 
of host-parasite interactions 

is based on pairwise species 
interactions. Betts et al. address 
this by using the bacterium 
Pseudomonas aeruginosa 

and five different phage virus 
parasites. Increasing parasite 


sciencemag.org SCIENCE 


diversity accelerated the rate 
of host evolution, driving both 
faster genomic evolution within 
populations and greater diver- 
gence between populations. 
Thus, different parasite loads 
prompt different evolution- 
ary dynamics and profoundly 
shape host evolution by different 
mechanisms. —CA 

Science, this issue p. 907 


CAR T CELL SIGNALING 
More ITAMs for more 
potent receptors 


A form of cancer immuno- 
therapy uses the patient’s own 
T cells, which are engineered 
to express a chimeric antigen 
receptor (CAR) that recognizes 
acancer cell antigen. Increasing 
signal transduction efficiency 
in these CAR T cells would 
enhance treatment efficacy. 
James engineered synthetic 
T cell receptors with differing 
numbers of ITAMs, a protein 
motif involved in immune cell 
signaling. Increasing ITAM 
number enhanced the fraction 
of T cells that became activated 
by antigen, suggesting a strategy 
to improve the potency of CAR T 
cells. -ERW 

Sci. Signal. 11, eaan1088 (2018). 
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INORGANIC CHEMISTRY 


Beyond fossil fuel-driven 
nitrogen transformations 


Jingguang G. Chen,* Richard M. Crooks,* Lance C. Seefeldt,* Kara L. Bren, 

R. Morris Bullock, Marcetta Y. Darensbourg, Patrick L. Holland, Brian Hoffman, 
Michael J. Janik, Anne K. Jones, Mercouri G. Kanatzidis, Paul King, Kyle M. Lancaster, 
Sergei V. Lymar, Peter Pfromm, William F. Schneider, Richard R. Schrock 


BACKGROUND: The invention of the Haber- 
Bosch (H-B) process in the early 1900s to 
produce ammonia industrially from nitrogen 
and hydrogen revolutionized the manufacture 
of fertilizer and led to fundamental changes 
in the way food is produced. Its impact is 
underscored by the fact that about 50% of 
the nitrogen atoms in humans today originate 
from this single industrial process. In the cen- 
tury after the H-B process was invented, the 
chemistry of carbon moved to center stage, 
resulting in remarkable discoveries and a vast 
array of products including plastics and phar- 
maceuticals. In contrast, little has changed in 
industrial nitrogen chemistry. This scenario 
reflects both the inherent efficiency of the H-B 
process and the particular challenge of breaking 
the strong dinitrogen bond. Nonetheless, the 
reliance of the H-B process on fossil fuels and 
its associated high CO, emissions have spurred 
recent interest in finding more sustainable and 
environmentally benign alternatives. Nitrogen 
in its more oxidized forms is also industrially, 
biologically, and environmentally important, 


Possible routes for nitrogen 
transformations that eliminate 
or minimize the need for fossil 
fuels. A more thorough 
understanding of nitrogenase 
may lead to more efficient 
homogeneous catalysts for 
reduction of No to NH3. 
Coupling of theory and 
experiment will lead to 

more effective and stable 
heterogeneous and 
electrocatalysts. Innovative 
energy sources, such as 
plasmas, which involve 
nonequilibrium chemistry, 

may lead to new nitrogen 
conversion mechanisms. 


Chen et al., Science 360, 873 (2018) 25 May 2018 


Nitrogenase enzymes 


and synergies in new combinations of oxi- 
dative and reductive transformations across 
the nitrogen cycle could lead to improved 
efficiencies. 


ADVANCES: Major effort has been devoted to 
developing alternative and environmentally 
friendly processes that would allow NH3 pro- 
duction at distributed sources under more 
benign conditions, rather than through the 
large-scale centralized H-B process. Hydro- 
carbons (particularly methane) and water are 
the only two sources of hydrogen atoms that 
can sustain long-term, large-scale NH; produc- 
tion. The use of water as the hydrogen source 
for NH; production requires substantially more 
energy than using methane, but it is also more 
environmentally benign, does not contribute to 
the accumulation of greenhouse gases, and does 
not compete for valuable and limited hydro- 
carbon resources. Microbes living in all major 
ecosystems are able to reduce Nz to NH; by 
using the enzyme nitrogenase. A deeper under- 
standing of this enzyme could lead to more 
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Heterogeneous catalysts 


Plasma 


NH; 


efficient catalysts for nitrogen reduction under 
ambient conditions. Model molecular catalysts 
have been designed that mimic some of the 
functions of the active site of nitrogenase. Some 
modest success has also been achieved in design- 
ing electrocatalysts for dinitrogen reduction. 
Electrochemistry avoids the expense and envi- 
ronmental damage of steam reforming of me- 
thane (which accounts for most of the cost of 
the H-B process), and it may provide a means 
for distributed production of ammonia. On 
the oxidative side, nitric acid is the principal 
commodity chemical containing oxidized nitro- 
gen. Nearly all nitric acid is manufactured by 

oxidation of NH; through 
the Ostwald process, but 
Read the full article 4 More direct reaction of 
at http://dx.doi. N, with O, might be prac- 
org/10.1126/ tically feasible through 
science.aar6611 further development of 
nonthermal plasma tech- 
nology. Heterogeneous NHsg oxidation with O, 
is at the heart of the Ostwald process and is 
practiced in a variety of environmental pro- 
tection applications as well. Precious metals 
remain the workhorse catalysts, and oppor- 
tunities therefore exist to develop lower-cost 
materials with equivalent or better activity and 
selectivity. Nitrogen oxides are also environ- 
mentally hazardous pollutants generated by 
industrial and transportation activities, and 
extensive research has gone into developing and 
applying reduction catalysts. Three-way catalytic 
converters are operating on hundreds of millions 
of vehicles worldwide. However, increasingly 
stringent emissions regulations, coupled with 
the low exhaust temperatures of high-efficiency 
engines, present challenges for future combus- 
tion emissions control. Bacterial denitrification 
is the natural analog of this chemistry and 
another source of study and inspiration for 
catalyst design. 


OUTLOOK: Demands for greater energy effi- 
ciency, smaller-scale and more flexible pro- 
cesses, and environmental protection provide 
growing impetus for expanding the scope of 
nitrogen chemistry. Nitrogenase, as well as 
nitrifying and denitrifying enzymes, will even- 
tually be understood in sufficient detail that 
robust molecular catalytic mimics will emerge. 
Electrochemical and photochemical methods 
also demand more study. Other intriguing areas 
of research that have provided tantalizing 
results include chemical looping and plasma- 
driven processes. The grand challenge in the 
field of nitrogen chemistry is the development 
of catalysts and processes that provide simple, 
low-energy routes to the manipulation of the 
redox states of nitrogen. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: jgchen@columbia.edu (J.G.C.); 
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INORGANIC CHEMISTRY 


Beyond fossil fuel-driven 
nitrogen transformations 


Jingguang G. Chen,”’?* Richard M. Crooks,?* Lance C. Seefeldt,** Kara L. Bren,” 

R. Morris Bullock,® Marcetta Y. Darensbourg,’ Patrick L. Holland,® Brian Hoffman,? 
Michael J. Janik,’® Anne K. Jones," Mercouri G. Kanatzidis,? Paul King,’ 

Kyle M. Lancaster,'* Sergei V. Lymar,” Peter Pfromm,1* 


William F. Schneider,” Richard R. Schrock’® 


Nitrogen is fundamental to all of life and many industrial processes. The interchange of 
nitrogen oxidation states in the industrial production of ammonia, nitric acid, and other 
commodity chemicals is largely powered by fossil fuels. A key goal of contemporary 
research in the field of nitrogen chemistry is to minimize the use of fossil fuels by 
developing more efficient heterogeneous, homogeneous, photo-, and electrocatalytic 
processes or by adapting the enzymatic processes underlying the natural nitrogen cycle. 
These approaches, as well as the challenges involved, are discussed in this Review. 


educed forms of nitrogen, particularly am- 

monia (NHs), are vital to life. Before the 

early 1900s, all reduced N came from bio- 

logical nitrogen fixation by microbes. This 

landscape changed in the early 1900s with 
the invention by Haber and Bosch of an indus- 
trial process to reduce Nz to NH3 (No + 3H2 & 
2NH3). The Haber-Bosch (H-B) process has led 
to fundamental changes in the way food is pro- 
duced, and its impact is underscored by the fact 
that about 50% of the N atoms in humans today 
originate from it (7). For these reasons and others, 
the H-B process has been called the most im- 
pactful invention of the 20th century (2). How- 
ever, it is energy-intensive, consuming 1 to 2% 
of the world’s annual energy output (3). The main 
energy requirements arise from the high reaction 
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temperatures (~700 K) and pressures (~100 atm) 
and, most importantly, from the need for large 
quantities of H». Nevertheless, at scale, the H-B 
process is surprisingly energy-efficient. In addi- 
tion to the high energy demand, however, there 
is a second problem with the process that is po- 
tentially more serious. Specifically, the source of 
H, is usually natural gas (4), and, consequently, 
the H-B process generates about 1.9 metric tons 
of CO, per metric ton of NH3 produced (3 CO, 
per 8 NHs) (5). 

Nitrogen in its more oxidized forms is also 
industrially, biologically, and environmentally 
important. Oxidation of N, to nitric oxide (NO) 
and nitrogen dioxide (NO2), which are collect- 
ively referred to as NO, (the representation of 
neutral forms of oxidized nitrogen), occurs natu- 
rally in lightning and also during combustion of 
fuels in air (6). In fact, plasma-driven oxidation 
of N. to NO (the Birkeland-Eyde, or B-E, process) 
preceded the H-B process as the first commercial 
approach to nitrogen fixation (7). However, the 
B-E process is not economically competitive with 
the H-B process, and, as a result, the demand for 
oxidized nitrogen, primarily in the form of nitric 
acid (HNOs), is satisfied by catalytic oxidation 
of H-B-generated NH, at elevated temperatures 
through the Ostwald process (8). 

Nitrogen oxides (NO,, which represents both 
neutral and anionic forms of oxidized nitrogen) 
are also produced on a large scale by bacteria. 
These processes include oxidation of NH; to 
nitrate (NOs; ) by nitrifying bacteria and reduc- 
tion of NO, to nitrite (NO, ), NO, nitrous oxide 
(N,O0), and ultimately N, by denitrifying bacteria. 
The conversion of NH; and NO; to Nz by the 
action of these bacteria is used in wastewater 
treatment and is the primary pathway for loss of 
applied fertilizers in agriculture. Nitrogen oxides 
are common environmental pollutants, and thus 
catalysts to reduce these oxides to Nz are also of 


considerable practical importance. For example, 
NO, generated during combustion is a primary 
component of photochemical smog (9). The de- 
velopment of active catalysts for reducing NO, 
to N, (three-way catalytic converters, or TWCs) 
led to considerable improvements in air quality 
in the late 1970s (10), and TWCs now operate on 
hundreds of millions of vehicles worldwide. How- 
ever, increasingly stringent emissions regulations, 
coupled with the low exhaust temperatures from 
high-efficiency engines, present challenges for the 
future control of combustion emissions (77). The 
soluble forms of the nitrogen oxides, primarily 
NO, and NOs; , are similarly hazardous to hu- 
man health and contribute to the eutrophica- 
tion of waterways (12). Sustainable approaches 
for catalytically converting these pollutants to Ny 
remain an unmet challenge (13). 

In this Review, we report on the results of a 
U.S. Department of Energy workshop held in 
October 2016 that focused on the challenges 
and opportunities associated with fundamen- 
tal aspects of nitrogen chemistry. We begin by 
discussing the thermochemistry of nitrogen trans- 
formations; we then focus on the reduced forms 
of nitrogen, primarily NH, followed by the oxi- 
dized forms. 


Thermochemistry of nitrogen fixation 


The energy expenditures for various reductive 
and oxidative N, fixation pathways are compared 
in Fig. 1 (4-16). Hydrocarbons, including bio- 
mass and fossil fuels (particularly methane), and 
water are the only two sources of hydrogen atoms 
that can sustain long-term, large-scale NH3 pro- 
duction. The use of water as the hydrogen source 
for NH; production requires substantially more 
energy than using methane (by 613.4 kJ/mol N,), 
but it is also more environmentally benign, does 
not contribute to the accumulation of greenhouse 
gases, and does not compete for valuable and 
limited hydrocarbon resources. However, these 
advantages can be fully realized only if the en- 
ergy comes from a clean, renewable source such 
as the Sun or wind. 

When water is used as the feedstock, there 
is only a small (32.9 kJ/mol N2) thermodynamic 
advantage for the direct synthesis of NH3, com- 
pared with a two-stage synthesis involving water 
splitting followed by the H-B process. The overall 
energy saving may, however, become more sub- 
stantial if the H-B process is avoided altogether 
and direct reductive No» fixation is conducted 
under milder conditions. 

The two most important N-containing com- 
modity chemicals are NH; and HNOs. In 2017, 
their worldwide production was estimated to be 
150 (17) and 50 (18) million metric tons, respec- 
tively. Presently, nearly all HNO; is manufactured 
using a three-stage approach—steam reforming of 
methane, the H-B process, and then the Ostwald 
process (Fig. 1)—but this approach is energeti- 
cally wasteful compared with direct oxidative N, 
fixation. 

An attractive alternative route to HNO;, which 
avoids the NH; intermediate, is direct N. oxida- 
tion to aqueous HNOs (Eq. 1). 
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Nog) + 2.5042) a H2Oaiq) 
2 2H" (aq) + 2NO3 (aa) (1) 


The equilibrium constant for this reaction is 
~2.7 x 10°? M*/bar®® (standard Gibbs free en- 
ergy AG® = 14.6 kJ/mol No) (J4-16, 19), which is 
large enough to drive spontaneous formation of 
~0.1 M HNO; in any pool of water on the sur- 
face of Earth. Moreover, if equilibrium were to be 
achieved between Earth’s atmosphere and oceans, 
the oceans would contain ~0.02 M HNOs, and 
~90% of atmospheric O, would be consumed. 
Fortunately for life on Earth, under ordinary 
conditions and in the absence of a catalyst, Eq. 1 is 
unmeasurably slow in both directions. 


Reduction of Nz to NH3 
Overview 


As discussed in the previous two sections, the 
most important N, reduction reaction is NH; 
synthesis, specifically No + 3H, 2 2NH3. Given 
that this reaction is slightly thermodynamically 
favorable under ambient conditions, major effort 
has been devoted to developing alternative and 
environmentally friendly processes that would 
allow NH; to be produced at distributed sources 
under more benign conditions, rather than through 
the large-scale centralized H-B process. This goal 
is particularly important in developing countries 
where the population, and hence the need for 
food, is increasing rapidly. In these countries, 
access to fertilizer is hampered by poor trans- 
portation infrastructure and insufficient capital 
to build large chemical plants. For example, there 
are no large-scale NH; production facilities any- 
where in East Africa. To address this need, sub- 
stantial effort has been expended to understand 
N, reduction using a variety of catalysts, includ- 
ing heterogeneous, enzymatic, and homogeneous 
catalysts, as well as electro- and photocatalysts. 


Heterogeneous catalysis 


The commercial H-B process is carried out using 
a heterogeneous catalyst based on iron and pro- 
moted with Al,O; and potassium. It is now well 
accepted that the relative activity of metallic cat- 
alysts can be correlated to their binding energies 
with N-containing species in terms of a volcano- 
shaped relationship. Metals that bind nitrogen 
too strongly or too weakly are on either side of 
the volcano (20). For metals that have low bind- 
ing energies, Nz» dissociation is rate-limiting. For 
metals with high binding energies, No dissocia- 
tion occurs, but desorption of the resulting atomic 
N (and other N-containing intermediates) is slow, 
limiting the number of available binding sites and 
thereby slowing the catalytic rate (27). 
Recent calculations suggest that the energies 

of all intermediates and transition states involved 

in aconventional NHz synthesis reaction network 

scale with the N-binding energy. As shown in 

Fig. 2, these correlations limit the rate of NH 

synthesis on transition metals to much lower 

values than would be possible on a material with 

both a low N, dissociation barrier and more mod- 

erate binding energies (22). An important chal- 
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Fig. 1. Atom and energy economy of nitrogen fixation. The numerical values are the standard (14) 
Gibbs free energies in kilojoules per mole of fixed No in the direction of the arrows. All thermochemical 


data are from (15), except for H2NOH (16). 


which should lead to the development of cata- 
lysts yielding reaction rates that are potentially 
orders of magnitude higher than the current state 
of the art. 

Advances in the synthesis, characterization, 
and modeling of nontraditional heterogeneous 
catalysts—such as intermetallics, alloys contain- 
ing a single heteroelement, and shape-controlled 
materials—offer the promise for more active 
catalysts. These combined efforts should lead 
to catalysts that could potentially operate at 
lower temperatures and thus reduce the op- 
erating pressures of the H-B process. Another 
opportunity to reduce the CO, footprint of the 
H-B process is to replace the methane reformer 
with a water electrolyzer to provide CO,.-free Hy, 


but this would require cost-effective, large-scale 
electrolyzers. 


Enzyme catalysis 


A range of microbes (bacteria and archaea) living 
in all major ecosystems are able to reduce N, to 
NHz using the enzyme nitrogenase. Nitrogen 
fixation by these microbes contributes to the 
global supply of NH, but it is insufficient to 
support modern intensive agriculture. Exciting 
frontier areas for the application of microbial 
N, fixation include engineering nitrogen fixation 
genes into eukaryotes (23), such as plants, and 
using nitrogen-fixing microbes in electrocatalysis 
cells that permit sustained N, reduction to NH3 
(24). Such systems offer the promise of providing 
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localized solutions to Nz reduction but are not 
likely to displace the need for large-scale N» re- 
duction in the near term. 

One important lesson to learn from biological 
nitrogen fixation is how the enzyme nitrogenase 
accomplishes reduction of Nj to NH3 at ambient 
temperatures and pressures and without Hg. 
The first nitrogenase to be purified (and the most 
widely distributed) is molybdenum-dependent 
and, as shown in Fig. 3, has a metal-cluster active 
site containing Mo, Fe, S, C, and homocitrate 
(FeMo cofactor). 

An important step toward a mechanistic under- 
standing of nitrogenase emerged from trapping 
an intermediate with 4H* and 4e° accumulated 
immediately before binding No. Characterization 
of this intermediate (25) (termed the E, state) 
has revealed the presence of bridging hydrides 
(Fe-H-Fe). The E, state undergoes reductive elim- 
ination of the two hydrides with release of Hy», 
with concomitant activation of the metal core by 
2e for binding and reduction of N, to a metal- 
bound diazenido species (HN=N-Fe). Although 
these studies have provided insights into the 
central catalytic step, which is the cleavage of the 
N, triple bond, many challenges remain before 
the entire mechanism can be elucidated. These 
include a molecular-level understanding of both 
the early and late stages of the N» reduction path- 
ways, an understanding of the roles of adenosine 
triphosphate (ATP) hydrolysis in driving electron 
transfer, and then reconciliation of the empirical 
mechanism with computational studies. 

These foregoing challenges primarily involve 
substrate moieties bound as catalytic intermedi- 
ates. In parallel, important questions remain about 
the FeMo cofactor active site of Mo-dependent 
nitrogenase. For example, what changes in redox 
and spin states of the seven Fe centers and the 
Mo center accompany the stepwise transforma- 
tions of substrates? Additionally, the FeMo co- 
factor contains the first known example of a 
carbide (C*) in biology. The importance and 
function of this carbide with respect to mecha- 
nism are unknown. Understanding the structure 
of FeMo cofactor intermediates and synthetic 
analogs could provide important clues to guide 
the design of new homogeneous and heteroge- 
neous catalysts for Nz reduction (26). Biomimetic 
approaches could take advantage of not only 
the coordination chemistry in the FeMo cofactor, 
but also the cooperative interactions with nearby 
acidic amino acid residues in the nitrogenase 
protein. 

Electrochemical techniques offer a powerful 
alternative method of delivering electrons to 
nitrogenase without a requirement for ATP 
hydrolysis, and they also offer new avenues for 
addressing aspects of the nitrogenase mecha- 
nism. Electron transfer from an electrode, pre- 
viously demonstrated for a number of redox 
enzymes, has been applied to nitrogenase, but 
only via mediators (no direct electron transfer) 
(27-29). As illustrated in a recent study, this 
approach, in combination with computations 
(30), provides insights into catalytic nitrogen 
transformations. 
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Fig. 2. Linear scaling between nitrogen- 
binding energy and activation barrier for No 
dissociation. TOF, turnover frequency; Ea, 
activation energy. [Reproduced with 
permission from (22)] 


Fig. 3. FeMo cofactor. Fe, rust-colored; 

S, yellow; C, gray; Mo, magenta; O, red. The 
depiction is based on metrical data from PDB 
(Protein Data Bank) ID 1M1N. 


A related frontier area is using nanomaterials 
to deliver electrons to nitrogenase. For example, 
it has been recently demonstrated that a CdS 
nanorod-nitrogenase MoFe protein hybrid achieves 
light-dependent N» reduction to NH3 by using 
a sacrificial electron donor (37). Such systems of- 
fer a means to use light as the exclusive energy 
source to drive N. reduction. 


Homogeneous catalysis 


Synthetic homogeneous catalysts designed for 
N, reduction provide well-defined molecular 
precatalysts and intermediates, which can be 
thoroughly characterized by diverse spectroscopic 
techniques and thereby provide excellent oppor- 
tunities for determining mechanistic information 
about N» reduction to NH3. The reactivity of syn- 
thetic inorganic complexes offers insights into 
molecular reactivity and individual bond-making 
processes that occur in nitrogenases. The discov- 
ery of nitrogenase, along with a transition metal 
compound that contains N, bound to ruthenium 
(32), initiated a race to prepare NH; catalytically 
under mild conditions using a transition metal 


complex. Entire institutes were created in England 
and Russia in the late 1960s for this purpose. The 
stoichiometric N.-to-NH, conversion using H,SO, 
was demonstrated with low-valent Mo-N, and 
W-N, complexes in the 1970s (33); typically, 
the electrons required came from the metal 
complex. In 1985, Pickett and Talarmin accom- 
plished the synthesis of NH, by using an elec- 
trochemical route starting from the tungsten 
complex [W(N2)2(PMesPh),; Me, methyl, Ph, 
phenyl], with the electrons supplied by the elec- 
trode and protons from an added acid (34). The 
catalytic reduction of Ny to NH; by H* and & 
was reported in subsequent studies, using, for 
example, a well-defined Mo-containing catalyst 
(35). Up to eight equivalents of NH; per Mo 
were formed, along with Hy. Experiments and 
calculations both support a mechanism involv- 
ing the addition of 6H* and 6e to the Nz ini- 
tially bound to a Mo*" center. Altogether, eight of 
these proposed intermediates were prepared and 
characterized. 

Additional Mo precatalysts for reduction of 
N, using chemical reductants and an acid—a 
dimolybdenum-dinitrogen complex (36) anda 
molybdenum nitride (37)—were subsequently 
reported in 2011 and 2015, respectively. In 2013, 
Fe complexes were reported that catalyze reduc- 
tion of N. to NHs using a strong reducing agent 
(KCg) and a strong acid at 200 K (38); improve- 
ments to the Fe system recently produced as many 
as 84 equivalents of NH per Fe (39). These Fe 
complexes were designed to reflect the trigonal 
symmetry of the Fe in the catalytic face of the 
FeMo cofactor (Fig. 3) and to allow variation of 
an axial ligand, so as to explore tuning of No 
binding and reduction by the unusual carbide at 
the center of the FeMo cofactor. 

In all these cases, it appears that catalytic No 
reduction is a six-electron process, involving many 
intermediate chemical species (M-N,H,”"), but, 
unlike nitrogenase, without a mechanistic require- 
ment for production of H». Nonetheless, in all 
cases, N, reduction is accompanied by produc- 
tion of Hg, and at least with the Fe complexes, 
this has been shown to involve a catalytic process, 
with H* reduction competing with N2 reduction. 

Much work remains to move this field forward. 
For example, rational design of potential catalysts 
will be guided by accurate determination of the 
bond dissociation free energies (BDFEs) of N-H 
bonds, including likely intermediates such as 
M-N=NH, M=N-NHp, and M=N-NHs. Initial ex- 
perimental and theoretical estimates have shown 
that N-H BDFEs of coordinated NH; can vary 
considerably, and they demonstrate the impor- 
tance of metal identity, metal oxidation state, 
and the attached ligands, which together tune 
the stability of potential intermediates. In addi- 
tion, determination of the energies of these bonds 
will allow evaluation of energetic requirements 
of the individual chemical steps that ultimately 
lead to NHs. It is also important to identify fea- 
tures that tune the reduction potentials of rel- 
evant complexes to ensure that the multiple 
reductions required for catalysis are energeti- 
cally feasible. Thus, research should focus on 
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exploring complexes designed to simultaneously 
promote Ny, binding and multiple reductions at 
the metal site. 


Electrocatalysis and photocatalysis 


The potential-pH predominance (Pourbaix) dia- 
gram for the No-H,O system shown in Fig. 4 re- 
veals that reduction of Nj to NH,” or NH3 (Eq. 2 
and the N, reduction line in Fig. 4) is therm- 
odynamically possible under moderately reducing 
conditions. 


Nove) + 8H" (ag) [or 6H" (agy] + 6e~ 
= 2NH4 (aq) Lor 2NHaaq] (2) 


Moreover, with a sufficiently active electro- 
catalyst, the N» reduction reaction can, in prin- 
ciple, proceed in a narrow range of negative 
potentials without interference from water re- 
duction (line a in Fig. 4) over the entire pH 
range. However, at potentials below line a, an 
extremely selective electrocatalyst would be re- 
quired to suppress parasitic water reduction. The 
nitrogen reduction reaction could be paired with 
a water oxidation half-reaction to supply the H* 
and e required for a complete electrolysis cell. 
Such a cell would essentially combine water split- 
ting with NH, formation as depicted by the direct 
reductive fixation route in Fig. 1. Despite the 
inviting prospect of a scalable NH; electrolysis 
cell running on renewable electricity at or near 
room temperature, the difficulty of selectively 
catalyzing the nitrogen reduction reaction has 
prevented development of an efficient electro- 
chemical system. 

At present, only a few examples of pairing the 
nitrogen reduction reaction with water oxidation 
at ambient temperature and pressure have been 
reported, and in all cases, owing to competing Hy 
evolution, the faradaic efficiencies for NH, are so 
low that even its reliable detection is challenging 
(40, 41). For example, an efficiency of 0.83% and 
a production rate of 9.4 x 10°'° mol NH3 per 
square centimeter per second were measured 
at 353 K at an applied cell voltage of 1.2 V using Pt 
electrodes and a Nafion electrolyte (42). Another 
electrochemical approach involved an electro- 
chemical cycling process for producing NH; (43). 
The cycling included separate steps of LiOH 
electrolysis, Li nitridation, and LizN hydrolysis, 


Fig. 4. Partial Pourbaix diagram for the 
Nz-H20 system. Solid lines correspond to 
No reduction to NH,* or NH3 and No 
oxidation to NO3 . Dotted lines a and b 
straddle the region of water stability 
(reduction to Hz and oxidation to Oo, 
respectively). Details are given in (14), and 
primary data are in (15, 19). 


potential, V vs SHE 


Chen et al., Science 360, eaar6611 (2018) 


thereby circumventing the parasitic H, evolution 
reaction and leading to higher faradaic efficien- 
cies for NH3. 

Proton-conducting ceramic oxides have been 
explored as electrodes for nitrogen reduction, 
and when they operate at higher temperatures, 
both the rate and the faradaic efficiency in- 
crease (44). For example, efficiencies in excess 
of 50% and rates in the range of 10° to 10-* mol 
NHs3 per square centimeter per second have been 
attained above 673 K by using oxide or molten 
chloride electrolytes and a variety of metal or 
metal oxide cathodes (44, 45). However, higher 
temperatures introduce an additional energy bur- 
den and hence compromise the advantage of this 
approach compared with the H-B process. 

Photochemical reduction of Ny to NH; has also 
been reported, and these studies may contain 
guidance relevant to electrochemistry (46, 47). 
Photocatalytic routes to NH; have been summar- 
ized recently (48). One of the biggest challenges 
for photocatalytic NH, synthesis is the discovery 
of active and stable photoelectrode materials that 
are responsive to visible light. Synthetic nitroge- 
nase mimics in the form of chalcogels, composed 
of Mo- and Fe-containing biomimetic clusters, can 
potentially accomplish photocatalytic N, fixation 
and conversion to NH; at ambient temperature 
and pressure. Both Fe-S and Mo-Fe-S chalcogels 
have displayed promising activity toward N, re- 
duction (49). 


Plasma-driven transformations 


The feasibility of using plasmas consisting of 
ions, electrons, and excited molecular species to 
drive chemical reactions that are thermally inac- 
cessible has been known for many years (7). For 
example, NH; was synthesized in laboratory-scale, 
nonthermal plasmas of N. and Hy, at temper- 
atures and pressures lower than those used in 
the H-B process as early as the 1970s, and more 
recently, the synthesis has been demonstrated 
under much milder conditions (1 atm and 140°C) 
(50). The plasma synthesis of NH; has been carried 
out both homogeneously and in combination with 
heterogeneous catalysts (7, 50, 51). 

Several mechanisms have been proposed to 
account for enhancements in catalytic activity 
in the presence of nonthermal plasmas (52-54). 
However, the fundamental science and practical 
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engineering of plasma-driven NH syntheses lag 
behind thermal and electrochemical routes and 
are therefore ripe for additional research. Further 
progress in this field will draw on the insights 
gained from traditional heterogeneous and electro- 
chemical reductions. 


Chemical looping 


Chemical looping is perhaps most familiar from 
combustion, where two connected fluidized beds 
are used to cycle solid particles between an oxi- 
dized and a reduced state to combust a carbon- 
containing fuel with oxygen (55). Chemical looping 
can also be performed with two fixed beds instead 
of fluidized beds. Nitrogen can be used instead 
of oxygen for a similar looping approach in NH3 
synthesis (56). First, No is contacted with a suit- 
able solid-state transition metal to yield a nitride 
(activation). Second, NHs3 is harvested by contact- 
ing the nitride with steam or H, (57). Recently, 
lithium has been used as the parent metal for the 
nitride, but this approach requires liquid-phase 
electrolysis to recycle lithium, and this step is 
usually not present in chemical looping (43). 

Potential advantages of chemical looping in- 
clude the ability to independently control process 
conditions for N. activation and product harvest. 
In this sense, chemical looping can break the 
scaling relationship alluded to in Fig. 2. Operation 
at atmospheric pressure is another major advan- 
tage of chemical looping for nitrogen activation 
(58). On the basis of results from a pilot-scale 
plant, chemical looping for combustion, includ- 
ing CO, capture, is comparable to convention- 
al combustion and appears to be economically 
attractive (59). This may be an indication of the 
possible economic feasibility of chemical loop- 
ing for nitrogen activation. 


Oxidation of No 
Overview 


The dominance of the H-B process in commer- 
cial Np fixation is arguably a direct consequence 
of the ready availability of H, from fossil hydro- 
carbons. As shown in Fig. 1, oxidative fixation of 
N, can, in principle, be achieved with a lower- 
energy input than reductive fixation if No, H2O, 
and Oy, are used as reactants. Other nitrogen 
oxides are accessible at even lower free energies, 
especially when coupled with water to form the 
corresponding oxoacids. 

Although H-B and natural processes have 
motivated substantial research into reductive No 
fixation, direct N. oxidation remains largely un- 
explored despite its great practical value as a re- 
placement for the Ostwald process. The fact that 
living organisms have not evolved to consume 
N, and O, and produce aqueous HNOs suggests 
fundamental chemical challenges that merit in- 
vestigation. As a result, there is very little literature 
on the homogeneous oxidative chemistry of No. 


Direct oxidation of Nz with Oz 


NO, is unavoidably generated at the high tem- 
peratures that prevail during combustion in air. 
NO is the primary combustion-generated compo- 
nent of NO,, and its formation can be viewed 
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in the context of its equilibrium with N, and O, 
(Eq. 3). 


Nagy + Org) & 2NOwe) (3) 


The standard enthalpy of this reaction is 
90.3 kJ/mol NO, and the entropy is 12.4 J/K 
per mol NO (/4, 15), indicating that equilibrium 
is unfavorable for NO production under ambient 
conditions. Owing to the positive entropy, how- 
ever, the equilibrium NO concentration rises 
rapidly with temperature, reaching 0.8% in air at 
2000 K (60). 

The direct bimolecular reaction of Nj and O, 
to produce NO is symmetry-forbidden and has a 
negligible rate even at combustion temperatures. 
Rather, NO is produced during combustion via 
Eqs. 4 and 5, where O (and similarly OH) radicals 
attack No, as originally proposed by Zeldovich (67). 


O0+N,—>NO+N (4) 
N+0,—>NO+0 (6) 


Analogous reactions are at play in high- 
temperature thermal plasmas, as in the original 
B-E process. Thermal plasmas are unlikely to be 
energy-competitive with the H-B process, even if 
the latter uses a renewable Hz source (62), because 
the overall energy efficiencies of N, activation are 
low, and rapid thermal quenching is required 
to suppress NO decomposition back to N, and Oy. 

The aforementioned fundamental limitations 
can be mitigated using low-temperature, nonther- 
mal plasmas generated by either a dielectric bar- 
rier discharge or microwave absorption at or below 
atmospheric pressure, potentially coupled with 
appropriate catalysts (7). According to current 
estimates, theoretical energy consumption for 
N, fixation via Eq. 3 in a nonthermal plasma 
(~400 kJ/mol Nz) is lower by more than a factor of 
2.5 than that for the H-B process with methane- 
derived H, (62-64), and the energy efficiencies 
already attained in the laboratory [600 to 1200 kJ/ 
mol No, assuming 100%-efficient plasma gener- 
ation (63, 64)] are better than the H-B process 
using H. from water electrolysis [~3000 kJ/mol 
Nz (62)], 40% of which would go to electrolysis 
(65). Experiments and kinetic models show that 
the primary effect of the nonthermal plasma is to 
accelerate Zeldovich-like reactions involving non- 
thermal, vibrationally excited N». Practical imple- 
mentation of nonthermal plasmas is hampered 
by their relatively low power per unit reactant 
mass and thus low throughput. 

A thermal, heterogeneous N. oxidation cata- 
lyst would have to activate Nj and O, and be 
stable at the extreme temperatures at which NO 
production becomes thermodynamically favored. 
Because of these stringent demands, nonthermal 
electro-, photo-, or plasma-assisted oxidation routes 
likely offer the greatest opportunities for progress. 
The development of catalytic systems that can 
effectively couple with these external energy 
sources will be key to practical advances. Pre- 
liminary results with nonthermal plasmas, espe- 
cially coupled to catalysts, are promising. There 
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is a pressing need to uncover the fundamental 
mechanisms that unite plasma chemistry, surface 
chemistry, catalysts, and theory to guide optimi- 
zation of plasma and rational development of ap- 
propriate catalytic materials. 


Electrochemical oxidation of No to HNO3 


The oxidation line in Fig. 4 shows that Ny» is 
electrochemically unstable toward NO; under 
moderately oxidizing conditions (Eq. 6). 


Nag) + 6H20 ig) = 2NO3 (aq) 
+ 12H" (aq) + 10e7 


(6) 


At pH > 1, this 10-electron reaction is more ther- 
modynamically favorable than the parasitic four- 
electron water oxidation to O,. Thus, it is possible 
for NO; to be the only product of an anodic pro- 
cess, particularly in neutral and alkaline solutions, 
if a sufficiently active and selective electrocatalyst 
for Eq. 6 can be discovered. The corresponding cell- 
completing cathodic reaction can be either water 
reduction to H, or O, reduction to water. Little 
apparent progress has yet been made in develop- 
ing such catalysts, but there is no reason to believe 
that they cannot eventually be discovered. 

By coupling cathodic N, reduction (Eq. 2) 
with anodic N» oxidation (Eq. 6), an electro- 
chemical cell consuming only atmospheric No 
and water as feedstock to yield aqueous NH,NO3 
(Eq. 7) can be envisioned. 


Nog) + 3H20¢iq) # NH4" (aq) 
+ NOs (aq + Hog) 


(7) 


The additional H, production is necessary 
for matching cathodic and anodic currents and 
maintaining the electrolyte pH constant. Such 
a cell would require a minimum of 1.08 V to 
operate in neutral or alkaline electrolyte, which 
is slightly lower than the minimum water elec- 
trolysis voltage of 1.23 V. Advances relating to 
electrochemical N» oxidation will require devel- 
opment of catalysts that are sufficiently active 
(capable of good current densities at low over- 
potentials), selective (with respect to water oxi- 
dation), and stable (toward deactivation under 
harsh redox and pH operating conditions). 


NO, reduction and NH; oxidation 
Overview 


The redox chemistry of forms of nitrogen other 
than N, is immense and rich, and it is extensively 
incorporated into biological systems. Both NO,, 
and NH; are environmentally hazardous pollu- 
tants generated by industrial and transportation 
activity, and thus much research has gone into 
developing and applying NO, reduction and NH; 
oxidation catalysts. As is often the case, the under- 
standing of these catalysts lags their applications. 

Ammonia oxidation and NO, reduction are 
critical to environmental protection and are much 
less well understood or optimized than the syn- 
thesis of NH3. There are appealing opportunities 
to apply the experimental and computational tools 


of modern heterogeneous catalysis to these re- 
actions, both to develop a fundamental under- 
standing of the processes and to identify superior 
materials and catalytic transformations—for ex- 
ample, catalysts that can selectively reduce NO, 
to Ny using hydrocarbon reductants. Environ- 
mental applications generally demand high lon- 
gevity, durability, and tolerance to poisons. Key 
gaps in understanding include how activity and 
selectivity can be achieved in relatively cool gas 
streams, how catalyst structure evolves over long 
periods of time, how sulfur and other common 
poisons interfere with reactions, and especially 
how these effects can be mitigated. 


NO, reduction 


NO decomposition, the reverse of Eq. 3, is ther- 
modynamically downhill but difficult to catalyze 
under the oxygen-rich conditions of interest for 
environmental protection (66). Nevertheless, NO 
decomposition has been demonstrated using both 
homogeneous and heterogeneous catalysts. De- 
composition appears to involve a two-electron 
reduction of two NO molecules to a hyponitrite 
(Eq. 8). 


2NO +e — NO.” (8A) 
N20. + e@ — NoO.?” (8B) 
N.O,” + 2H* > N,O + HO (8C) 


Coordination complexes of hyponitrites are 
not uncommon, and a tri-Cu hyponitrite coordi- 
nation complex has been isolated as an inter- 
mediate in the catalytic decomposition of NO 
to Nz (67). A Cu dimer has similarly been im- 
plicated as the active site in a Cu-exchanged zeo- 
lite catalyst for NO decomposition (68). 

NO, can be catalytically reduced to N, via 
several routes. Precious metals are active for NO, 
reduction by CO and hydrocarbons, but only 
under conditions in which O. concentrations are 
low (10). Vanadia/titania catalysts and metal- 
exchanged zeolites catalyze selective reduction 
of NO, by NHz (69). Nitrosamine appears to be 
the key N.-forming intermediate in the catalytic 
pathway (Eq. 9). 


NH; + NO — H,NNO + H* + & (9A) 


H,NNO — Ny + HO (9B) 


This N.-forming reaction is exothermic, imply- 
ing substantial, but perhaps not insurmountable, 
kinetic challenges to reversing the chemistry. 

NO,, can also be catalytically reduced with Hy 
over Pd and Pt catalysts (73), and Ng is typically 
the desired product (66). By suitable selection 
of promoters and control of the NO/H, ratio, 
however, NH; can be produced over Pt catalysts 
(70). Likewise, NH; competes with Na, as the 
product of the catalytic reduction of NO. and 
NO; by Hz, over Pd catalysts (13). 

Certain bacteria analogously reduce NO; and 
NO, through a series of gaseous intermedi- 
ates, leading ultimately to N, in a process called 
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denitrification (Eq. 10). In these bacteria, NO, 
acts as a terminal electron acceptor in an an- 
aerobic respiration process. Electrons flowing 
through membrane-bound proteins produce a 
proton gradient that in turn is used to make 
ATP. Each step in the denitrification process is 
catalyzed by well-studied metalloenzymes that 
use unusual metal clusters to achieve their chem- 
istry (71, 72). From a global nitrogen perspective, 
denitrification by these bacteria functions to con- 
vert fixed forms of N (NO,) into No. In some 
cases, this process is beneficial, such as during 
wastewater treatment to remove nitrates. In other 
cases, such as in agriculture, denitrification re- 
sults in loss of N that was applied to crops as 
fertilizer. 


2NO, +10e +12H* > N,+6H,O (10) 


NH3 oxidation 


Heterogeneous NH; oxidation with O, is at the 
heart of the Ostwald process and is practiced in a 
variety of environmental protection applications. 
Precious metals remain the workhorse catalysts, 
and therefore opportunities exist to develop lower- 
cost materials having equivalent or better activ- 
ity and selectivity. The preferred NH; oxidation 
products shift from N» at low temperatures to 
NO at higher temperatures, with N.O being a 
minor but highly undesirable product. At present, 
a full mechanistic understanding of this process 
and of the kinetic origins of selectivity is lacking. 
NHs3 and/or NO, are fuel for bacteria and 
archaea that use multi-electron oxidative reac- 
tions to generate reducing equivalents for cellular 
respiration and to establish a proton gradient for 
ATP synthesis (73, 74). Ammonia-oxidizing bacte- 
ria (AOB) and archaea (AOA) stoichiometrically 
convert NH; to NO. . AOA and AOB first carry 
out the selective conversion of NH; to hydroxyl- 
amine (NH,OH) by using an integral membrane 
Cu monooxygenase called ammonia mono- 
oxygenase (AMO) (Eq. 11). Active AMO has never 
been purified, and thus this challenging trans- 
formation awaits mechanistic elucidation. 


NH; + %0. — NH.OH cen) 


The NH,OH generated by AMO is oxidized 
to provide a net electron flow for cellular respi- 
ration. How AOA processes NH,OH is unknown 
(75). In AOB, this process is carried out by hy- 
droxylamine oxidoreductase (HAO), which con- 
tains a rare, covalently modified c-heme, heme 
P460, to which NH,OH binds at Fe and under- 
goes proton-coupled oxidation (73). Recent exper- 
iments strongly support NO as the enzymatic 
product of HAO reactivity, contrary to decades of 
dogma asserting NO, as the enzymatic product 
(76). Assuming a two-electron turnover of AMO, 
the three-electron oxidation of NH,OH by HAO 
furnishes one net electron for cellular respira- 
tion. Stoichiometric production of NO, from 
NH; by AOB likely involves an additional, but as 
yet unidentified, third enzyme in the pathway. 
Cytotoxicity of both NO and NH,OH may be 
managed by cytochrome P460, a monoheme en- 
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zyme that selectively produces N.O from the 
reaction of NH2OH with Fe-bound NO (77). NO.” 
is itself cellular fuel for nitrite-oxidizing bacteria 
that effect the two-electron oxidation of NO. to 
NO; and for the recently discovered “complete 
ammonia oxidation” (or comammox) bacteria 
that effect the eight-electron oxidation of NH; to 
NO; (78). 

Opportunity abounds to extract broadly ap- 
plicable insights through the study of the oxidative 
enzymes operative in nitrification. Establishing 
the mechanism of NH; oxidation by AMO remains 
a grand challenge, whose difficulty is exacerbated 
by the integral membrane nature of the enzyme 
and the sluggish growth of its host organisms. 
The end goal is tantalizing technology: selective 
hydroxylation of relatively inert bonds that is 
generalizable beyond N species—especially con- 
sidering that enzymes related to AMO, such as 
particulate methane monooxygenase, effect sim- 
ilar transformations of unactivated alkanes (79). 

Mechanistic understanding of NH,OH and 
NO, oxidation offers insights into proton-coupled, 
multi-electron transformations that are crucial 
to redox catalysis. NH2OH is an energetic, cytotoxic 
metabolite that AOB and AOA harness as a source 
of reducing equivalents. The absence of requisite 
c-heme biosynthesis pathways in AOA necessi- 
tates substitution of HAO for either a nonheme 
or a Cu-based enzyme for NH,OH oxidation (80). 
Thus, opportunities exist to fill in a missing link 
in the biogeochemical nitrogen cycle while ex- 
panding understanding of how transition metals 
can transduce energy from redox-active small 
molecules. 


Concluding outlook 


Since the beginning of the 20th century, both 
the ability to produce and the need to remediate 
nitrogen-containing compounds have been coupled 
to fossil fuels. Demands for greater energy ef- 
ficiency, smaller and more flexible distributed 
processes, and environmental protection provide 
growing impetus to expand the scope of practi- 
cally viable oxidative and reductive transforma- 
tions of nitrogen that are not driven by fossil fuels. 
Opportunities exist to identify new and radically 
improved pathways, but progress in this regard 
will require a molecular-level understanding of 
nitrogen transformation reactions, as well as the 
translation of these core insights to the discovery 
of new catalytic systems and alternative means 
of delivering the energy needed to drive those 
reactions. These advances will emerge through 
the collective knowledge and insights to be gained 
from fundamental research that integrates exper- 
iments and theory in hetero- and homogeneous 
catalysis, photon- and electron-driven processes, 
and biology. 
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INTRODUCTION: The complete sequence 
of animal genomes has had a transformative 
impact on biological research. Whereas the 
genome sequence of an organism contains 
the information for its development and 
physiology, the transcriptomes (the sets of 
actively transcribed genes) of the cell types 
in an organism define how the genome is 
used for the unique functions of its cells. Cell 
number and complexity have historically 
made the identification of all cell types, much 
less their transcriptomes, an extreme chal- 
lenge for most multicellular organisms. Recent 
advances in single-cell RNA sequencing (SCS) 
have greatly enhanced the ability to determine 
cell type transcriptomes, with SCS of thou- 
sands of cells readily achievable. 


1. Cell type I 


RATIONALE: We reasoned that it might be 
possible, given these advances, to determine 
the transcriptomes of essentially every cell type 
of a complete organism possessing an unknown 
number of cell types. The planarian Schmidtea 
mediterranea, famous for its regeneration 
ability, is an attractive case study for such an 
undertaking. Planarians possess a complex 
anatomy with diverse differentiated cell types, 
including many found across animals. Further- 
more, planarians contain a proliferating cell 
population called neoblasts that includes plu- 
ripotent stem cells. Neoblasts mediate re- 
generation and constitutive tissue turnover. 
Consequently, lineage precursors for essentially 
all differentiated cells types are also present in 
adults. Finally, planarians constitutively express 
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An atlas of planarian cell type transcriptomes. High-throughput single-cell RNA sequencing 
of adult planarians reveals a cell type transcriptome atlas that includes rare cell types and 
many novel cell populations, cellular transition states, and patterning information, as 
demonstrated by two regionally expressed genes in muscle. 
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positional information guiding tissue turnover. 
Therefore, comprehensive SCS at a single time 
point (the adult) could allow transcriptome 
determination for all differentiated cell types 
and for lineage precursors, and could identify 
patterning information that guides new cell 
production and organization. Capturing this 
information in most organisms would require 
sampling adults and many transient embry- 
onic stages. 


RESULTS: We used the SCS method Drop-seq 
to determine the transcriptomes for 66,783 
individual cells from adult planarians. We locally 


saturated cell type cover- 
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age by iteratively sequen- 
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in the data. Clustering the 
cells by shared gene ex- 
pression grouped cells into broad tissue clas- 
ses. Subclustering of each broad tissue type in 
isolation enabled separation of cells into the 
cell populations constituting each tissue. These 
analyses enabled the identification of a previ- 
ously unidentified tissue group and the clas- 
sification of poorly characterized tissues into 
their constituent cell types, including numer- 
ous previously unknown cell types. Transcrip- 
tomes were identified for many rare cell types, 
including those that exist as rarely as ~10 cells 
in an animal that has 10° to 10° cells, which 
suggests that near-to-complete cellular satu- 
ration was reached. In addition, transcriptomes 
for known and novel lineage precursors, from 
pluripotent stem cell to differentiated cell 
types, were generated. Precursor transcrip- 
tomes identified transcription factors required 
for maintenance of associated differentiated 
cells during homeostatic cell turnover. Finally, 
the data were used to identify genes regionally 
expressed in muscle, which is the site of pla- 
narian patterning gene expression. 


CONCLUSION: We successfully used SCS to 
generate transcriptomes for most to all cells 
of a complete organism. This resource pro- 
vides a wealth of data regarding the cellular 
site of expression of thousands of conserved 
genes and the transcriptomes for cell types 
widely used in animals. These data will inform 
studies of these genes and cell types broadly, 
and will provide a resource for the fields of 
planarian biology and comparative evolution- 
ary biology. This work also provides a template 
for the generation of cell type transcriptome 
atlases, which can be applied to a large array of 
organisms. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: reddien@wi.mit.edu 

Cite this article as C. T. Fincher et al., Science 360, eaaql736 
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Christopher T. Fincher,””? Omri Wurtzel,”” Thom de Hoog,”” 
Kellie M. Kravarik,”** Peter W. Reddien”””?* 


The transcriptome of a cell dictates its unique cell type biology. We used single-cell RNA 
sequencing to determine the transcriptomes for essentially every cell type of a complete 
animal: the regenerative planarian Schmidtea mediterranea. Planarians contain a 
diverse array of cell types, possess lineage progenitors for differentiated cells (including 
pluripotent stem cells), and constitutively express positional information, making them 
ideal for this undertaking. We generated data for 66,783 cells, defining transcriptomes 
for known and many previously unknown planarian cell types and for putative transition 
states between stem and differentiated cells. We also uncovered regionally expressed 
genes in muscle, which harbors positional information. Identifying the transcriptomes for 
potentially all cell types for many organisms should be readily attainable and represents 


a powerful approach to metazoan biology. 


he complete sequence of animal genomes, 

such as that of Caenorhabditis elegans re- 

ported in 1998 and humans in 2001, has 

had an immeasurable impact on research 

(1-3). Whereas the genome sequence of an 
organism contains the information for its de- 
velopment and physiology, the transcriptomes 
(the sets of actively transcribed genes) of the 
cell types in an organism define how the genome 
is used for the unique functions of its cells. 
Recent advances in RNA sequencing of indi- 
vidual cells have greatly enhanced the ability to 
determine cell type transcriptomes (4, 5), and 
single-cell RNA sequencing (SCS) of thousands 
of cells has become readily achievable (6). For 
example, the transcriptomes of most cell types 
of complete C. elegans L2 larvae and numerous 
mouse cells were recently reported with this 
approach (7, 8). We reasoned that it might be 
possible, given these advances, to determine the 
transcriptomes of essentially every cell type of 
a complete adult organism possessing an un- 
known number of cell types. 

Multicellular organisms can have many mil- 
lions of cells and hundreds of different cell types, 
and the cellular composition of organisms varies 
markedly over the course of development. This 
complexity has historically made the identification 
of all cell types, much less their transcriptomes, 
for most multicellular organisms an extreme chal- 
lenge. The planarian Schmidtea mediterranea is 
an attractive case study organism for which to 
generate the transcriptomes for all cells in an 
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animal. Planarians are famous for their ability 
to regenerate essentially any missing body part, 
and they possess a complex body plan contain- 
ing many characterized cell types (9, 10). De- 
spite this complexity, with an average planarian 
possessing ~10° to 10° cells (12), planarians are 
smaller with simpler anatomy than humans and 
many other model systems such as mice. Planar- 
ians are also easily dissociated into single-cell 
suspensions, allowing potential characterization 
of all cells. Because some planarian cell types, 
such as glia (12, 13), have only recently been de- 
fined with molecular markers, it is probable that 
undescribed planarian cell types exist. The com- 
bination of known and potentially unknown cell 
types is attractive for developing approaches 
that can apply to diverse organisms with varying 
amounts of available cell type information. Pla- 
narians possess a population of proliferative cells 
called neoblasts that contain pluripotent stem 
cells, enabling their ability to regenerate and 
replace aged cells in tissue turnover (4). Neo- 
blasts are the only cycling somatic cells and the 
source of all new tissue. Neoblasts contain mul- 
tiple classes of specialized cells, with transcrip- 
tion factors expressed to specify cell fate (15, 16). 
Because of the constant turnover of planarian 
tissues, essentially all stages of all cell lineages, 
from pluripotent stem cell to differentiated cell, 
are anticipated to be present in the adult (9, 17). 

Planarians also constitutively and regionally 
express dozens of genes that have roles in posi- 
tional information (78). These genes, referred to 
as positional control genes (PCGs), are expressed 
in a complex spatial map spanning anterior- 
posterior (AP), medial-lateral (ML), and dorsal- 
ventral (DV) axes (/8), and their expression is 
largely restricted to muscle (79). PCGs are hy- 
pothesized to constitute instructions for the 
maintenance and regeneration of the body plan. 
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Because of these features, comprehensive SCS 
at a single time point (the adult) could allow 
transcriptome identification for all differenti- 
ated cell types, lineage precursors for these cells, 
and the patterning information that guides new 
cell production and organization. To capture 
this information in most organisms would re- 
quire sampling the adult and many transient 
stages of embryogenesis. 


Single-cell RNA sequencing of 50,562 
planarian cells 


Planarians have a complex internal anatomy 
including a brain, ventral nerve cords, peripheral 
nervous system, epidermis, intestine, muscle, an 
excretory system (the protonephridia), and a cen- 
trally located pharynx (10). These major tissues 
are composed of multiple different cell types that, 
together with other gland and accessory cells, 
constitute the planarian anatomy. 

To detect planarian cell types and states in an 
unbiased manner, including rare cell types, we 
used the SCS method Drop-seq (6) to determine 
the transcriptomes for 50,562 individual cells from 
adults (Fig. 1A, fig. SIA, and table S1). Planarians 
contain 10° to 10° cells (ZZ), and yet some cell types 
are extremely rare, such as the ~100 photoreceptor 
neurons of eyes (20). Given such rarity, sequencing 
random cells from entire animals might not reach 
cell type saturation with even 10° cells sequenced. 
Therefore, we divided animals into five sections 
(head, prepharyngeal region, trunk with pharynx 
removed, tail, and the pharynx itself) and cells 
from each region were dissociated, sorted by flow 
cytometry, and sequenced (Fig. 1A, fig. SIA, and 
table S1). Sequences were aligned to a previously 
assembled transcriptome (27). We targeted cell 
type saturation by assessing coverage of known, 
rare cell types during iterative rounds of cell 
isolation and sequencing in a region-by-region 
approach. In total, 25 separate Drop-seq runs 
were completed, yielding cells with an average of 
3020 unique molecular identifiers (UMIs) and 
1404 genes (~13% of the estimated detection 
limit) (fig. S1, A to C, table S1, and supplemen- 
tary materials). 

Genes with high variance and expression 
across cells were used to generate informative 
principal components using Seurat (6, 22). Cells 
were clustered using Seurat into 44 distinct 
major clusters using a graph-based clustering 
approach and were visualized by applying 
t-distributed stochastic neighbor embedding on 
transcriptomes (t-SNE) (Fig. 1B and fig. S1D). 
Cells from different regions were largely inter- 
spersed in the t-SNE plots, except for cells from 
the pharynx, which contains many unique cell 
types (fig. S2A). Cell doublets were scarce with- 
in the data and did not affect clustering results 
(fig. S2, B to D). To determine the identity of 
each cluster, we identified cluster-specific genes 
by means of a receiver operating characteristic 
curve analysis and a likelihood ratio test based 
on zero-inflated data (table S2) (23). Expression 
of established cell type markers within each 
cluster and fluorescence in situ hybridization 
(FISH) with cluster-specific markers enabled 
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cluster assignment to one of eight previously 
identified planarian tissue classes: protoneph- 
ridia, neural, epidermis, intestine, pharynx, 
muscle, neoblast, and parenchymal (Fig. 1C). 
The parenchymal class was previously termed 
“parapharyngeal” because of localization of some 
enriched markers around the pharynx (24). How- 
ever, most cell populations within this class 
exhibit broader localization in the planarian 
parenchyma. We also identified a ninth group 
of clusters marked by CTSL2 (dd175) expression 
(Fig. 1D). CTSL2 (dd175) FISH revealed cells with 
long processes distributed broadly. We designated 
this group of clusters the cathepsin* class. Hierar- 
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chical clustering of a subset of 5000 cells by 
Euclidean distance, independently of Seurat, 
recapitulated assignment of cells into these 
nine tissue classes (fig. $3). 

Clusters representing the major planarian 
tissue classes were generally heterogeneous in 
terms of gene expression. For example, neural 
clusters contained a large number of known 
neuronal cell types, which suggests that multiple 
distinct cell types could be identified within 
each major cluster (fig. $4). Therefore, we sys- 
tematically subclustered each major cluster 
group (Figs. 2 to 6), identifying >150 subclusters, 
and determined genes with enriched expres- 


Seurat 2 


iu 
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sion in cells of each subcluster (table S2). 
Subclustering proved a powerful approach to 
defining the collection of cell types that con- 
stituted each major cluster and identified can- 
didate transition states between stem cells and 
differentiated cells. 


Progenitors in planarian cell lineages 


Neoblasts are abundant and express canonical 
marker genes such as smedwi-I (25), vasa (26), 
and bruli (27) (Fig. 1C and fig. S5, A and B). Neo- 
blasts are cycling cells and consequently show 
enrichment in expression of S/G./M cell cycle 
markers (fig. S5C). To identify the transcriptomes 


0 
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Fig. 1. Drop-seq of 50,562 planarian cells. (A) Schematic illustrating the workflow used to isolate and cluster single cells. (B) t-SNE representation 
of 44 clusters generated from the data. (C and D) Upper panels: t-SNE plots colored according to gene expression (red, high; blue, low) for highly 

enriched genes from nine planarian tissue classes. Red outlines denote clusters assigned to that tissue class. Lower panels: FISH images for tissue- 
enriched genes. Scale bars: whole-animal images, 200 um; insets, 50 um. 
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of potential neoblast subpopulations, we se- 
lected in silico and subclustered 12,212 cells 
with smedwi-1 expression of =2.5 [In(UMI-per- 
10,000 + 1)] (Fig. 2A and fig. S6A). Resulting 
clusters on the left of the plot were enriched 
in S/G2/M cell cycle markers (fig. S6B). These 
clusters included the previously characterized 
major specialized neoblast classes, including 
y-neoblasts (intestine progenitors) and C-neoblasts 
(epidermis progenitors) (28) (Fig. 2B). A num- 
ber of other subclusters were also identified, 
including one marked by expression of the 
contig dd_10988 (fig. S6, C and D). FISH con- 
firmed that dd_10988 was expressed in a neo- 
blast subset as well as in a number of smedwi-T” 
cells (Fig. 2C). 

The large number of subclustered neoblasts 
facilitated transcriptome determination for can- 
didate progenitors for many planarian tissues. 
Clusters to the right of the plot were marked by 
a G,/Go cell cycle status and displayed expression 
of various tissue markers (Fig. 2D and fig. S6B). 
These included a population defined by expres- 
sion of POU2/3, a marker for protonephridia- 
specialized neoblasts (29), and a number of 
subclusters expressing markers also expressed 
in specific differentiated tissues or their post- 
mitotic precursors, such as ChAT for the nervous 
system, prog-1 for the epidermis, ASCL4 (dd1854) 
for parenchymal cells, and COL4AGA (dd2337) for 
muscle (Fig. 2D and fig. S7A). Expression of these 
markers in smedwi-I* cells suggests that these 
cells could be transition states for those lineages. 
Several markers enriched in the dd_10988* sub- 
cluster, including dd_10988, were also expressed 
in cells of the two smedwi-I* neural subclusters, 
as well as in neural cells of the initial clustering 
(figs. S6C and S7, B and C), which suggests that 
the dd_10988* subcluster is enriched in neural 
progenitors. Likewise, many markers enriched 
in the PLODI (dd3457)* subcluster were also 
expressed in the smedwi-I* muscle subcluster, 
which suggests that the PLODI (dd3457)* sub- 
cluster is enriched in muscle progenitors (fig. 
S7D). prox-1, hnf-4, nkx2.2, and gata4/5/6 encode 
transcription factors expressed in intestinal pro- 
genitors (28), and in these data all four genes 
were expressed in y-neoblasts (Fig. 2, B and E, 
and fig. S7E), with hnf-4, nkx2.2, and gata4/5/6 
also expressed in intestinal clusters (Fig. 2E 
and fig. S7F). hnf-4, but not prox-1, nkx2.2, and 
gata4/5/6, was also expressed in a smedwi-I* 
cell cluster enriched in CTSZ2 (dd175) expression 
(the cathepsin* cell marker) and in differen- 
tiated cathepsin® cells (Fig. 2E and fig. S7G). 
The additional transcription factor-encoding 
genes ETSI (dd2092) and FOXFI (dd6910) were 
expressed with Anf-4 in these cells and also 
displayed expression patterns similar to that of 
CTSL2 (dd175) in the animal (Fig. 2F and fig. S8, 
A and B) and have recently been shown to reg- 
ulate the planarian pigment cell lineage (30). 
Pigment cells clustered within the cathepsin* cell 
class in our data (see below). By FISH, hnjf-4 was 
indeed coexpressed with nkx2.2 and gata4/5/6 
in the intestine, but was also coexpressed with 
cathepsin* cell markers (fig. S8, C to E), which 
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suggests that hnf-4 is expressed in two distinct 
lineages. These data demonstrate the utility of 
this approach for identifying potentially novel 
neoblast progenitor populations and the tran- 
scription factors that define them. 

Some planarian neoblasts display pluripotency 
in clonal assays and are hypothesized to generate 
all lineage-committed neoblast subpopulations, 
and are called clonogenic neoblasts (4). We 
selected cells expressing high levels of smedwi-1 but 
that excluded C- and y-neoblasts [including sub- 
clusters 2, 9, dd_10988+, dd_6998+, dd_17796+, 
SAMD15 (dd19710)+, dd_11221+, dd_13666+, and 
PLODI (dd3457)+] and subclustered this set of 
neoblasts in isolation (fig. S9, A and B). A rem- 
nant C-neoblast population (clusters 4 and 6), as 
well as protonephridia progenitors (cluster 10) 
and the putative neural (clusters 2 and 5) and 
muscle (clusters 1 and 9) progenitor populations 
described above in the smedwi-I* cell subcluster- 
ing, were identified (fig. S9C and table S2). Clusters 
0, 3, 7, and 8 were largely devoid of specifically 
enriched markers (table S2). It is therefore possi- 
ble that clonogenic neoblasts are defined by an 
absence of any tissue-specific markers, as opposed 
to the unique expression of specific genes. 

When all cells were clustered together, nu- 
merous smedwi-I" cells were present regionally 
within each of the other eight major planarian 
tissue clusters (Fig. 1C). We reasoned that these 
smedwi-I" cells could represent progenitors for 
the cell types within each associated tissue clus- 
ter. We therefore examined these smedwi-I’ cells 
after taking each tissue class in isolation and sub- 
clustering the data. 

The planarian epidermis contains ciliated and 
nonciliated cells as well as dorsal-ventral bound- 
ary epidermis (10, 28, 31), and the lineage from 
¢-neoblasts to epidermal cells is well character- 
ized (31-33) (Fig. 2G). SCS reveals gene expression 
transitions during neoblast epidermal differenti- 
ation (3D; subclustering 11,021 epidermal lineage 
cells (Fig. 1C) produced subclusters associated 
with each epidermal lineage stage (Fig. 2H and 
fig. S10, A and B). Plotting gene expression onto 
this t-SNE map showed a continuous progression 
from C-neoblast to differentiated cells (Fig. 2I and 
fig. S10C). 

The gene dd_554 [SmedASXL_059179 in (34)] 
is expressed in candidate pharynx progenitors 
(34) (smedwi-I’ cells at the pharynx base) and in 
smedwi-I cells within the pharynx (34) (Fig. 2J 
and fig. S11). Subclustering the 1083 nonmuscle, 
non-neuronal pharynx cluster cells (Fig. 1C) re- 
vealed that smedwi-I cells sequenced from non- 
pharynx midbody tissue clustered with pharynx 
cells, despite not being part of the pharynx itself 
(Fig. 2, K and L). Because the pharynx lacks 
neoblasts, pharynx-specialized neoblasts must be 
outside of the pharynx. This clustering of neo- 
blasts with pharynx cells clearly demonstrates 
the ability of SCS data clustering to associate 
lineage precursors with differentiated cells. Sim- 
ilarly, many dd_554* cells sequenced from out- 
side of the pharynx clustered with pharynx cells 
(Fig. 2, K and L). Plotting smedwi-1/dd_554 ex- 
pression onto pharyngeal subclusters revealed a 
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progression from smedwi-I* cells isolated outside 
the pharynx to dd_554* cells isolated outside the 
pharynx to dd_554** cells isolated inside the phar- 
ynx to pharyngeal cells (Fig. 2, K and L). These 
epidermis and pharynx examples demonstrate 
how precursor stages within cell lineages can 
be identified from subclustering cells within a 
major tissue class. Because planarians constantly 
generate new differentiated cells for essentially 
all tissue types (17, 20), transcriptomes for lin- 
eage precursors for essentially every cell type in 
the body could in principle be studied with this 
approach. 

Cell lineages for many planarian cell types are 
largely uncharacterized. After tissue type sub- 
clustering, smedwi-I* cells were present with lo- 
cally high expression in resultant t-SNE plots; 
smedwi-1 expression level gradually declined in 
cells across subclusters (Fig. 2, M and N). These 
smedwi-I* cells, similar to the epidermis and 
pharynx cases, could represent transition states 
between pluripotent neoblasts and differentiated 
cells for the various cells of the protonephridia, 
intestine, muscle, nervous system, parenchymal, 
and cathepsin* cells (Fig. 2, M and N). The 
smedwi-I" cells found within subclusters of the 
major tissue type classes generally displayed 
enriched expression of at least one transcription 
factor. For example, smedwi-1 expression was 
high within cells at the center of the parenchymal 
cell t-SNE plot and displayed a graded decrease 
projecting in all directions into seven major pa- 
renchymal subclusters (Fig. 2N). Each projection 
was associated with enriched expression of one 
or more distinct transcription factors, identifying 
candidate transcription factors associated with 
the specification of different parenchymal cell 
types (Fig. 20). 


Subclustering cells by tissue type 
uncovers rare cell types 


The protonephridia, the planarian excretory and 
osmoregulatory system, contains flame cells for 
filtering fluids, proximal and distal tubule cells, 
and a collecting duct (29, 35, 36). The proto- 
nephridia is a model tissue for studying organ 
regeneration and the evolution of kidney-like 
excretory systems. Subclustering of 890 proto- 
nephridia cells (Fig. 1C) identified each known 
protonephridia cell type as a separate subclus- 
ter, revealing the complete transcriptomes of 
these cells (Fig. 3A and fig. S12, A to C). Fur- 
thermore, two protonephridia subclusters with 
smedwi-I* cells were identified (Fig. 2M). One 
was enriched in flame cell gene expression (e.g., 
dd_2920) and the other in a proximal tubule 
marker (dd_10830), which suggests that they 
might be flame and tubule cell precursors, re- 
spectively (fig. S12, D and E). 

Less is known regarding the full complement 
of cell types in other planarian tissues. Ultra- 
structural studies suggested that the planarian 
intestine contains two cell types: absorptive 
enterocytes and secretory goblet cells (37, 38). 
Subclustering of 3025 intestinal cells (Fig. 1C) 
revealed three distinct cell populations (clusters 4, 
5, and 8) (Fig. 3B and fig. S13, A and B). FISH 
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Fig. 2. Subclustering identifies neoblast subpopulations. (A) t-SNE 
representation of 22 clusters generated from subclustering cells with 
smedwi-1 expression = 2.5 [In(UMI-per-10,000 + 1)]. Identity of 
numbered clusters unknown. PP, parenchymal; PN, protonephridia. 
Intestine cluster is indicated by lower expression of smedwi-1 and 
enriched gata4/5/6 and hnf-4 expression. (B) smedwi-1* t-SNE plots 
colored by prox-1 and zfp-1 expression. (©) Double FISH image for 
dd_10988 and smedwi-1. Yellow arrows highlight coexpression; white 
arrows denote absence of coexpression. (D) smedwi-1* t-SNE plots 
colored by expression of differentiated tissue-enriched genes. Arrows 
indicate gene expression sites. (E) Left: smedwi-1* t-SNE plots colored 
by gata4/5/6 and hnf-4 expression. Right: All cluster t-SNE plots 
colored by gata4/5/6 and hnf-4 expression. C, cathepsin* cells; |, intestine; 
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@ Pharynx @ Trunk 
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Neural 


y-Nb, y-neoblasts. (F) smedwi-1* t-SNE plots colored by ETS1 (dd2092) 
and FOXF1 (dd6910) expression. (G) Epidermal cell maturation stages. 
(H) t-SNE representation of epidermal subclusters. FISH images labeled 
by their associated cluster(s) are shown. (I) Epidermal t-SNE plots 
colored by epidermal lineage marker expression from (G). (J) dd_554 
FISH. (K) Pharynx t-SNE plots colored by smedwi-1 and dd_554 
expression. (L) Pharynx t-SNE plot colored by the body region from 
which each cell was isolated. (M and N) t-SNE plots, colored by smedwi-1 
expression, generated by subclustering cells identified as (M) proto- 
nephridia, intestine, muscle, cathepsin*, neural, and (N) parenchymal. 
(O) Parenchymal t-SNE plots colored by expression of eight transcription 
factor—-encoding genes enriched in (N). Arrows indicate gene expression 
sites. Scale bars, 50 um (C), 200 wm [(H) and (J)]. 
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with subcluster-enriched markers (table S2) 
revealed distinct intestine components. Cluster 4 
represented an inner intestine cell layer (Fig. 3C) 
and was enriched for absorptive enterocyte mar- 
kers (39). Cluster 8 cells were largely present 
within the primary intestine branches, resem- 
bling the pattern of goblet cells (40). A third 
group (cluster 5) represented an outer intestine 
cell layer and displayed a set of enriched genes 
different from that of clusters 4 and 8 (Fig. 3C 
and table S2). In addition to these three main in- 
testine components, clusters representing putative 
transition states were also identified. Clusters 1, 3, 
and 6 included many smedwi-I* cells (Fig. 2M). 
Genes with enriched expression in clusters 0 and 
7 displayed expression spanning into the enter- 
ocyte cluster (cluster 4), suggesting these might 
be enterocyte transition states (fig. S14A). Genes 
with enriched expression in clusters 2 and 3 dis- 
played expression spanning into the outer intes- 
tine cluster (cluster 5) and might reflect transition 
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Transition State 2. 
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or variant states of these cells (fig. S14B). The 
Monocle toolkit can be used to predict cellular 
transitions in lineages (47) and was used to 
build single-cell trajectories for the enterocyte 
and outer intestine cell lineages, closely recapit- 
ulating the candidate transition states identified 
by Seurat (Fig. 3D, fig. S15, and table S3). 
Several transcription factors required for the 
specification of various planarian cell types have 
been identified with RNA interference (RNAi) 
and gene expression studies. Because of constant 
tissue turnover, RNAi of transcription factor- 
encoding genes expressed in specific classes of 
specialized neoblasts in adult planarians can 
lead to steady depletion of the cell type generated 
by that specialized neoblast class (29, 42, 43). The 
transcriptomes identified here generate a re- 
source of enriched gene expression for different 
cell types, including transcription factor-encoding 
genes. Accordingly, inhibition of the transcription 
factor-encoding PTFIA (dd6869) gene, which 


B intestine 


had enriched expression in candidate transition 
states for the outer intestine cluster, strongly 
reduced this cell population while not affecting 
absorptive enterocytes of the intestine (Fig. 3E). 

The nervous system displays by far the greatest 
known cell type composition complexity of the 
major planarian tissues. By subclustering 11,907 
neuronal cells (Fig. 1C), we identified 61 distinct 
subclusters representing a diversity of cell types 
and states (Fig. 4A and fig. S16A). Twelve sub- 
clusters had high smedwi-1 expression, which 
suggested that they represent neuronal pre- 
cursors (fig. SI6B). Cluster 10 contained cells of 
the brain branches, as determined by expression 
of gpas (44) and pds (45) (Fig. 4B and fig. S16C). 
Three subclusters (clusters 3, 7, and 8) were defined 
by expression of pc2 (encoding a neuropeptide- 
processing proprotein convertase) as well as an 
assortment of markers for rare neuron classes 
in the cephalic ganglia and ventral nerve cords 
(Fig. 4C and fig. S16, D and E). We also sequenced 
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Fig. 3. Subclustering of tissues reveals transcriptomes for known 


(1+ 000'0L4ed- Wn) 


Seurat 2 


dd_4390 (CI. 2,3) dd_115 


dd_115 (CI. 5) 
dd_888 (Cl. 2,5) 


dd_1 (Cl. 4) 


PRSS12 (dd790) (Cl. 7) 


=< 
z 
[ 
x) 
é 
c 
° 
to) 


3210-1 
= 


Relative Expression 


dd_75 dd_115 


aaa 


PTF1A (dd6869) RNAi 


and novel cell populations. (A) t-SNE representation of the proto- 
nephridial subcluster. FISH images are labeled by their associated 
cluster. (B) t-SNE representation of intestinal subclusters. (©) Double 
FISH images of genes enriched in separate intestinal subclusters. 
Numbers indicate the associated subcluster for each marker. (D) Top: 
Cell trajectory of enterocyte and outer intestinal cell lineages produced 
by Monocle. Cells are colored by identity. Bottom: Heat map of branch 


dependent genes (g value < 10~*°) across cells plotted in pseudo-time (41). 
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Cells, columns; genes, rows. Beginning of pseudo-time is at center of 
heat map. “CI.” annotation indicates a log-fold enrichment = 1 of 

the gene in that intestine Seurat cluster. (E) Top left: Intestine t-SNE 
plot colored by expression of PTFIA (dd6869). Top right: Illustration 
of cutting scheme used to generate fragments. Bottom: dd_115 and 
dd_75 FISH of control and PTFIA (dd6869) RNAi animals. Animals 
were cut and fixed 23 days after the start of double-stranded RNA 
(dsRNA) feedings. Scale bars: whole-animal/fragment images, 

200 um; insets, 50 um. 
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an additional 7766 cells from the brain region to 
expand the number of cells in these clusters (fig. 
S16, F to H). In addition to these large clusters, 
there existed a number of smaller, compact, and 
well-separated subclusters. These could be further 
divided into ciliated and nonciliated neurons 
according to the expression of rootletin (dd6573), 
which encodes a ciliary rootlet component (Fig. 
4D and fig. S17A). Because of further heteroge- 
neity within these clusters (e.g., opsin* presump- 
tive photoreceptors were present together, but 
not as a Separate cluster), data from these two 
cell sets (ciliated, not ciliated) were each taken in 


A Neural 


Seurat 1 


isolation for further subclustering. This yielded 
37 nonciliated neuron subclusters (Fig. 4E and 
figs. S17B, S18, A and B, and S19 to S21) and 25 
putatively ciliated neuron subclusters (Fig. 4F and 
figs. S17B, S22, A and B, and S23B). We assessed 
the localization of cells associated with 46 of 62 
of these subclusters by FISH using subcluster- 
specific markers. The observed cell types had a 
wide range of patterns including rare cell types 
such as photoreceptor neurons (Fig. 4, E and F, 
and figs. S18 to S23). Many genes had enriched 
expression in multiple clusters; the distribution 
of neural cell types they represented was de- 


B ; Cc 


fined by a combinatorial set of markers (figs. 
$18 to S23). A number of identified cell types 
from different subclusters displayed similar 
localization patterns. However, FISH demon- 
strated no overlap in subcluster-specific mark- 
ers, consistent with the SCS data (Fig. 4G). For 
several neural subtypes, we found smedwi-I* can- 
didate precursor cells. Four nonciliated neuron 
subclusters (subclusters 1, 2, 4, and 12) and a 
single ciliated neuron subcluster (subcluster 1) 
were enriched in smedwi-1 expression (fig. S24A). 
Nonciliated neuron subcluster 4 also expressed 
gata4/5/6, as did six smedwi-I clusters (clusters 


Hi Non-ciliated neurons 
WiCiliated neurons 
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Fig. 4. Subclustering of neural cells reveals known and novel cell popula- 
tions. (A) t-SNE representation of the neural subcluster. (B and C) Top: 

t-SNE plots colored by expression of gpas (B) and pc-2 (C). Bottom: FISH for 
gpas (B) and pc-2 (C) labeled with the associated neural subcluster. (D) t-SNE plot 
in (A) overlaid with outlines indicating the ascribed identity of each subcluster as 
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ciliated or nonciliated. (E and F) t-SNE representation of subclustered cells 
identified in (D) as nonciliated (E) or ciliated (F). (G) Double FISH images of three 
sets of nonciliated neuron genes enriched in separate subclusters. Numbers 
indicate the associated nonciliated neuron subcluster(s) for each marker. Scale 
bars: whole-animal images, 200 uum; insets, 50 um. 
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14,16/33, 24, 26, and 32) that radiated out from 
central smedwi-I* cells, raising the possibility 
that these smedwi-I* cells constitute precursors 
for these populations (fig. S24B). 

The pharynx is a muscular tube used for feed- 
ing and defecation (10). It is contained within an 
epithelial cavity and connects to the intestine at 
its anterior end via an esophagus. Pharyngeal 
muscle cells and pharyngeal neurons clustered 
together with the other muscle cells and neurons 
of the body (figs. S25A and $26A). Other pharynx- 
associated cells, including cells from isolated 
pharynges and surrounding tissue, constituted 
the other major pharynx clusters. These non-neural, 
nonmuscle pharynx and pharynx-associated cells 
(Fig. 1C, 2 = 1083 cells) were subclustered, and 
FISH was performed on cluster-enriched markers 
(Fig. 5A and fig. $25, B and C). Subclusters in- 
cluded pharyngeal cavity epithelium cells (clusters 7 
and 8), the epithelial pharynx lining (clusters 1 
and 5), the mouth and esophagus (cluster 9), cells 
near the pharynx opening (cluster 6), and cells 
that constitute the connection to the planarian 
body (cluster 4). FISH confirmed nonoverlapping 
expression patterns for markers of tested separate 
cell populations (Fig. 5B). 

Planarian muscle expresses collagen in addi- 
tion to canonical muscle genes such as troponin 
and tropomyosin (19). Muscle exists in a sub- 
epidermal body wall layer, in the pharynx, sur- 
rounding the intestine, and in a DV domain (46). 
Subclustering 5014 muscle cells (Fig. 1C) revealed 
seven smedwi-I° candidate precursor subclusters 
(clusters 0, 1, 3, 4, 5, 10, and 11) (Fig. 2M), as 
well as subclusters containing body wall muscle 
(cluster 7), pharyngeal muscle (cluster 2, 8, 9, 
and 12) (fig. S26A), a population of muscle cells 
enriched around the intestine (cluster 6), and an 
unidentified population (cluster 13) (Fig. 5C and 
fig. S26, B and C). Markers for body wall muscle 
(cluster 7) and cluster 13 were expressed in non- 
overlapping cells by FISH (Fig. 5D). 

Whereas some molecular characterization 
existed for the seven broad planarian tissue 
classes previously mentioned, very little is known 
regarding the cellular composition of the two 
remaining classes. The parenchymal class (Fig. 1C) 
(24) was highly heterogeneous, with subcluster- 
ing of 2120 cells identifying many distinct cell 
populations (Fig. 5E and figs. S27, A and B, and 
S28B). In addition to eight smmedwi-I" putative pre- 
cursor subclusters (clusters 0, 1, 2, 3, 4, 6, 8, and 
most of 9) (Fig. 2N), parenchymal cell subcluster- 
ing revealed 13 well-separated differentiated cell 
subclusters. FISH showed that each of these dif- 
ferentiated cell populations were present as scat- 
tered cells, presumably within a mesenchymal 
tissue layer called the parenchyma that surrounds 
major planarian organs (JO). Previous morpho- 
logical studies determined that the parenchyma is 
composed of multiple gland cells, neoblasts, and 
“fixed parenchymal cells” characterized through 
histological and electron microscopy studies as a 
likely phagocytic cell with long cellular processes 
filling most of the parenchymal space (10, 47, 48). 
Some identified parenchymal subclusters ap- 
peared to be gland cells, displaying processes 
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extending to the epidermis, defining transcrip- 
tomes for these cells. Candidate gland cell types 
included two that were exclusively dorsal (clusters 
16 and 17), two exclusively lateral (clusters 10 and 
13, including marginal adhesive gland cells and an 
unknown cell population), four present both dor- 
sally and ventrally (clusters 7, 11, 14, and 15), and 
one present ventrally near the brain (cluster 19). 
Three subclusters (clusters 5, 12, and 18) contained 
cells with patterns similar to those of planarian 
neoblasts, but were not neoblasts. Finally, a single 
subcluster contained large cells surrounding the 
pharynx (small group of cluster 9 cells) and were 
enriched for expression of previously identified 
metalloprotease-encoding genes (49). Three pairs 
of parenchymal subclusters (six subclusters total) 
were confirmed to exist in nonoverlapping pop- 
ulations by FISH (Fig. 5F). 

The transcription factor-encoding gene nkx6- 
like was expressed in a parenchymal cell popu- 
lation marked by dd_515. Inhibition of nka6-like 
ablated dd_515 cells, while not affecting a dis- 
tinct, non-enriched parenchymal cell population 
marked by dd_385 (Fig. 5G). These results fur- 
ther highlight the potential to use the data to 
ablate many specific cell types in the animal. 

The final major class of cells, the cathepsin* 
group, contained 7034: cells (Fig. 1D). This group 
of clusters contained recently described glia 
and pigment cells (72, 13, 50). Subclustering of 
cathepsin* cells identified four subclusters ex- 
pressing smedwi-1 that represented putative pre- 
cursor cells (clusters 0, 1, 3, and 6) (Fig. 2M), a 
glial subcluster (cluster 15), and two pigment 
cell populations (clusters 11 and 14), identifying 
transcriptomes for these cell types (Fig. 6A and 
figs. S29, A and B, and S30B). Eight cathepsin* 
subclusters represented previously unidentified 
cell populations. FISH revealed striking, elab- 
orate morphologies for most of these cells, in- 
volving long processes and unique distributions 
(Fig. 6A and figs. S29B and S30B). Cells from 
subclusters 5 and 10 were spread throughout 
the planarian body, with long processes filling 
substantial parenchyma space. Subcluster 8 rep- 
resented cells specific to the pharynx. Subcluster 
9 cells were scattered throughout the animal. 
Subclusters 4 and 16 identified cells with dense 
aggregated foci of elaborate processes at scattered 
locations throughout the animal that lacked de- 
finitive positions—an unusual and unanticipated 
cell type distribution. FISH identified markers 
labeling cell bodies of these cells, revealing that 
the aggregates comprised many cells (Fig. 6B). 
Subclusters 12 and 13 also exhibited processes 
with visible cell bodies. Subcluster 12 cells were 
largely subepidermal. The most elaborate of 
these newly identified cells (subclusters 5 and 
10) were excluded from the intestine and brain, 
but had processes around the branches of the 
intestine and protonephridia and interspersed 
within the cephalic ganglia (Fig. 6, C to E, and 
fig. S31, A and B). FISH confirmed nonoverlap- 
ping expression patterns for two tested sub- 
clusters (fig. S31C). 

Subcluster 7 of the cathepsin* group of cells 
was enriched in expression of genes with expres- 
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sion spanning into clusters 5 and 10 (fig. S32A). 
Similarly, expression of cluster 2 marker genes 
spanned into clusters 4 and 16 (fig. S32B). These 
cells might reflect transition or variant states 
of cells for clusters 5/10 and 4/16, respectively. 
SMEDWJI-1 protein perdures in neoblast prog- 
eny after loss of smedwi-1 mRNA, allowing de- 
tection of newly produced neoblast progeny (27). 
MAP3K5 (dd4849)* cells, which were predicted 
to be expressed in cells transitioning from the 
smedwi-I* state in the cathepsin* cell plot, were 
SMEDWI-1*/smedwi-I, supporting the interpre- 
tation that these cells are progenitors in the 
cathepsin* cell lineage (fig. S32, C and D). The 
Monocle toolkit was also used to build single- 
cell trajectories for these clusters, with data closely 
recapitulating the transition states identified by 
Seurat (Fig. 6F, fig. S33, and table S3). 


A near complete discovery of planarian 
cell type transcriptomes 


The number of cell types identified in this study 
vastly exceeded prior planarian SCS data (24). 
Within the neuronal subclusters, a 17-cell sub- 
cluster represented photoreceptor neurons (Fig. 
4E), which are present at ~100 total cells in a 
medium-sized (~2 to 3 mm) animal (20). There- 
fore, our data should have readily included 
unknown cell types as rare as photoreceptor neu- 
rons. Similarly, an average-sized planarian has 
~60 cintillo* neurons, and our data included 
10 cintillo* neurons (fig. S34A). These cells were 
grouped within a larger subcluster (cluster 3) of 
nonciliated neurons (fig. S34B), suggesting that 
even further subclustering of this “subcluster 3” 
could reveal additional distinct cell types. Indeed, 
cintillo* cells emerged as a unique cluster from 
such additional (fourth tier) subclustering of orig- 
inal data (fig. S34C and table S2). Esophagus cells, 
connecting pharynx to intestine, clustered with 
mouth cells in the pharynx subclustering data 
(fig. S34D). About 50 of these cells exist in an 
average-sized animal, and three such cells were 
present in the data (fig. S34, A and D). Several 
known rare cell types did not separate into in- 
dividual clusters, although most could still be 
identified in the data, which suggests that the 
data are largely saturated for rare cell types. These 
include anterior pole cells, which function as an 
anterior organizer (51-53); notum* neurons in the 
brain (54); and posterior pole cells (45), each of 
which are among the rarest known cell types in 
the animal, with only ~10 each present in an 
average animal (fig. S34A). Five anterior pole cells, 
10 notum* neurons, and one posterior pole cell 
were identified in the data (fig. S34, E to G). Sim- 
ilarly, ~25 ovo" eye progenitors (42) and ~90 nanos* 
germ cells (55) are present in an average animal 
(fig. S34A). Two eye progenitors and 19 germ 
cells were identified in the data (fig. S34, H and I). 
In addition to the asexual strain of S. mediterranea 
used in this study, a sexual strain of cross- 
fertilizing hermaphrodites exists. We sequenced 
8455 cells from this strain, adding sexual strain 
cells to this resource as well, including seven 
yolk cells and seven testes cells in addition to 
the 19 germ cells described above (fig. S35, A to D). 
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Further sequencing of sexual cell types could be 
a target for future studies. Together, our data 
indicate that we have essentially reached satura- 
tion for determining the cell type transcriptomes 
of asexual planarians. 


Discovery of novel patterning genes 

Planarians constitutively express dozens of genes 
associated with patterning (PCGs) in complex 
spatial patterns across body axes (18). PCGs are 
almost exclusively expressed in muscle (19). AP- 


axis PCGs are well established, including with 
muscle SCS (56). Muscle cells did not subcluster 
according to their anatomical positions (Fig. 7A 
and fig. S36, A to C). However, we reasoned that 
expression of known PCGs could ascribe locations 
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Fig. 5. Tissue subclustering identifies cell populations of poorly char- 
acterized tissues. (A) t-SNE representation of the pharynx subcluster. 
FISH images are labeled by their associated cluster(s). (B) Double FISH 
images of pharynx markers enriched in separate subclusters. Numbers 
indicate the associated pharynx subcluster(s) for each marker. (C) t-SNE 
representation of the muscle subcluster. (D) Double FISH images of two 
muscle markers enriched in separate subclusters. Numbers indicate 

the associated muscle subcluster for each marker. (E) t-SNE representa- 
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tion of the parenchymal subcluster. (F) Double FISH images of three sets 
of parenchymal markers enriched in separate subclusters. Numbers 
indicate the associated parenchymal subcluster for each marker. (G) Top 
left: Parenchymal t-SNE plot colored by expression of nkx6-like. Top right: 
Illustration of cutting scheme used to generate fragments. Bottom: 
dd_515 and dd_385 FISH of control and nkx6-like RNAi animals. Animal 
sections were cut and fixed 23 days after the start of dsRNA feedings. 
Scale bars: whole-animal/fragment images, 200 um; insets, 50 um. 
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Fig. 6. Tissue subclustering reveals a previously unidentified class of 
cells. (A) t-SNE representation of the cathepsin® cell subcluster. FISH 
images are labeled by their associated cluster(s). Images associated with 
subclusters 5/10 and 8 are single slices in the animal. All other images are 
maximum intensity projections. (B) Double FISH for two cathepsin” cell 
markers enriched in the same subclusters, 4 and 16. (C to E) FISH for 
dd_9 and mat (C), ChAT (D), and dd_7742 (E). (F) Top: Cell trajectory of 


to muscle cells in the data. Because of var- 
iability in the expression of any one PCG, mus- 
cle cell regional identity was determined on 
the basis of expression of at least two PCGs. 
For example, posterior muscle cells were iden- 
tified by coexpression of at least two of the 
four posterior PCGs wntilI-1, wntll-2, fz4-1, and 
wntP-2, yielding 163 cells (Fig. 7A and fig. S36D). 
Differential expression analysis using the algo- 
rithm SCDE (57) was performed on these 163 cells 
against the 4851 other muscle cells (Fig. 7A and 
table S4). Strikingly, nine of the differentially 
expressed genes were identified by Scimone e¢ al. 
(56) as posterior-enriched; eight of these were 
within the top 26 genes identified by differential 
expression analysis (Fig. 7B, hypergeometric P = 
2.75 x 10°°). A similar analysis on 837 anterior 
muscle cells was also performed (fig. S36, A and 
D, and table S4). Nine of the differentially ex- 
pressed genes were identified by Scimone e¢ al. 
(56); four of the genes were within the top 25 
genes identified by SCDE (Fig. 7B, hypergeometric 
P = 5.80 x 10°). We also applied this approach 
to the less well studied ML axis. We identified 
62 lateral muscle cells, and FISH with 15 of the 
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top genes identified seven with lateral muscle 
expression (Fig. 7C and fig. S36, B and D to G) 
(58, 59). We isolated 90 medial muscle cells, and 
the top-ranked gene displayed a striking thin 
stripe of expression down the dorsal midline 
(Fig. 7C and fig. S36, C, D, and H). Together, these 
results demonstrate the power of deep SCS for 
identifying regional gene expression, such as that 
involved in patterning, in adult animal tissues. 


Discussion 


RNA sequencing of >50,000 cells (in total, 
66,783 cells were sequenced) of the planarian 
S. mediterranea allowed the identification of 
transcriptomes for most to all cell types of an 
adult animal. This includes transcriptomes for 
cell types present as rarely as 10 cells in an ani- 
mal with 10° to 10° cells, which strongly suggests 
that we have reached near-saturation. Sequenc- 
ing of different body regions and assessment 
of rare cell type coverage in an iterative process 
enabled us to reach this saturation level. Some 
cell types might escape detection by this tech- 
nique if they are exceptionally rare or hard to 
dissociate from the animal. Our data did indicate 


25 May 2018 


dd_1831 PTPRT 


@ Neoblast 
@ dd_1831+ cells 


@dd_9+ cells 


Component 1 


0 2 4 


Component 2 


dd_18135 (Cl. 2) 


dd_1831 (Cl. 4) 


dd_9 (CI. 5,7) 
NPEPL1 (dd181) (Cl. 7) 


Relative Expression 


dd_1831* and dd_9* cathepsin™ cell lineages produced by Monocle. Cells 
are colored by identity. Bottom: Heat map of branch dependent genes 

(q value < 107!”°) across cells plotted in pseudo-time (41). Cells, 
columns; genes, rows. Beginning of pseudo-time is at center of heat 
map. “Cl.” annotation indicates a log-fold enrichment = 1 of the gene in 
that cathepsin* cell Seurat cluster. Scale bars: whole-animal images, 
200 um; insets and (B) to (E), 50 um. 


that some cell types were preferentially recovered 
according to the abundance of that cell type 
by FISH, whereas others were less represented 
(fig. S37, A and B). In particular, prog-I* epi- 
dermal progenitor cells were highly overrepre- 
sented in the data relative to their prevalence 
in the animal, perhaps because their small size 
made their isolation easier (fig. S37A). Absent 
prog-I* cells, most other cell types analyzed were 
represented similarly to their relative abundance 
in the animal (fig. S37B). Regardless of differ- 
ences in ease of dissociation between cell types, 
we recovered data from all known cell types as- 
sessed. Not every known rare cell type emerged 
as a separable cluster; that is, these cells were 
sometimes embedded within a larger cluster. In 
some instances, further rounds of subclustering 
based on such knowledge resulted in splitting of 
subclusters into additional subclusters. There- 
fore, further subclustering analyses and even 
deeper sequencing will likely continue to en- 
hance the capacity to computationally isolate 
rare cell types from other clusters. Nonetheless, 
the transcriptomes for such rare cell types are 
present in our data and can be studied by 
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Fig. 7. Identification of new regionally expressed 


(A) Top: t-SNE plot colored by muscle cells positive for expression = 0.5 
[In(UMI-per-10,000 + 1)] of two of the four posterior PCGs wntl11-1, 
wnt1l-2, fz-4, and wntP-2. Pink, positive cells; gray, negative cells. Bottom: 
Transcriptomes for posterior muscle cells were compared to all other 
muscle cells by SCDE. (B) List of differentially expressed genes in 
posterior and anterior muscle cells that were identified in Scimone et al. 


searching for the desired cells. Another challenge 
inherent in assessing saturation of cell type se- 
quencing is ambiguity with the term cell type. 
Gene expression heterogeneity exists within well- 
defined clusters and could reflect differences at- 
tributable to technical sampling error, cell type 
state differences, or robust differences in bio- 
logical function. Further in vivo morphological 
and functional studies with identified cell clus- 
ters, further computational analyses, and even 
more sequencing data can continue to refine 
the knowledge of biologically important cell 
type differences. 

Cell types have been previously identified 
largely through morphological descriptions and 
perhaps a few marker genes. Determining cell 
type transcriptomes with large-scale SCS is a 
powerful new approach to defining the cell type 
constitution of a tissue, an organ, or even a com- 
plete animal. In our study, we identified a large 
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number of previously uncharacterized planarian 
cell populations across multiple tissues. This in- 
cluded multiple cell populations (in the cathepsin* 
group) previously undescribed at the molecular 
level. One cell population, defined by dd_9 ex- 
pression, had long processes filling parenchy- 
mal space and surrounding, but excluded from, 
other planarian tissues. This pattern is reminiscent 
of “fixed parenchymal cells,” a largely unchar- 
acterized cell population described by histology 
and electron microscopy (EM) (48). Previous EM 
work suggested that fixed parenchymal cells are 
likely phagocytic, with clearly observed lysosomes; 
dd_9* cells highly expressed genes encoding a 
variety of digestive enzymes and endocytosis 
proteins, providing further support for this 
hypothesis (table $2). The biology of these 
cathepsin* cells and all the other diverse cell 
types identified in this work can now be studied 
in depth using identified transcriptomes and 
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(56). Rank indicates the rank of the gene in our analysis. (C) FISH 
images of one lateral and one medial expressed gene ranked highly in 
this analysis (59). Number indicates gene rank in the list generated 
by SCDE. Scale bars: whole-mount images, 200 um; insets, 50 um. 
(D) Illustration highlighting the capacity of the data set to identify 
almost all cell types in the planarian, as well as specialized neoblast 
progenitors and novel patterning information from the adult animal. 


the tools of planarian biology research. For in- 
stance, we show for two case studies above that 
RNAi of a gene encoding a transcription factor 
with enriched expression in a candidate cell lin- 
eage leads to ablation of the predicted differ- 
entiated cell. 

Generating transcriptomes for most to all cell 
types in an animal will be invaluable for study- 
ing gene function and the biology and evolution 
of a large range of important cell types. Because 
of their phylogenetic position within the Spiralian 
superphylum (60), major cell types found across 
diverse bilaterians (e.g., shared between humans 
and Drosophila, C. elegans, molluscs, annelids, and/ 
or other bilaterians) should have been present in 
the last common ancestor of planarians and hu- 
mans. As such, studying the transcriptomes and 
associated genes with cell type-enriched expression 
in this data set can allow characterization of the 
gene function underlying the biology of these cells. 
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Planarian biology presents many features that 
made this organism attractive for comprehensive 
SCS. Planarians are a model for studying nu- 
merous important problems in regeneration, 
stem cell biology, patterning, and evolution. At a 
single time point—the adult—there exist progen- 
itors for essentially all cell types and the pat- 
terning information for guiding new cell type 
production. We identified the transcriptomes 
of numerous candidate transition states in lin- 
eages from pluripotent stem cell to diverse dif- 
ferentiated cell types. Furthermore, we used the 
data to identify novel regionally expressed genes 
in planarian muscle (the site of patterning gene 
expression). Together, these results illustrate 
the capacity of our data set to define cell type 
transcriptomes, identify lineage transition states, 
and ascertain novel patterning information, all 
from a single time point (Fig. 7D). We propose 
that this atlas-like data set of cell type transcrip- 
tomes, much like the genome sequence of an ani- 
mal, can serve as a resource fueling an immense 
amount of research, not only in planarians but 
in other bilaterians with similar cell types. To 
facilitate such study, we developed an online 
resource that generates cluster expression data, 
for any gene, across all clusters and subclusters 
(digiworm.wi.mit.edu). Case study model orga- 
nisms have proved to be valuable testing grounds 
for developing approaches to complete genome 
sequencing; these planarian SCS data demon- 
strate an approach to near-to-complete cell type 
transcriptome identification that could be applied 
broadly to diverse organisms with varying de- 
grees of information about cell type composition. 
The remarkable ability of single-cell RNA se- 
quencing to reach nearly complete saturation of 
transcriptome identification for all the cell types 
of an animal represents a powerful approach for 
describing the anatomy of complete organisms 
at the molecular level. 
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SINGLE-CELL ANALYSIS 


Cell type atlas and lineage tree 
of a whole complex animal by 
single-cell transcriptomics 


Mireya Plass,* Jordi Solana,* F. Alexander Wolf, Salah Ayoub, Aristotelis Misios, 
Petar Glazar, Benedikt Obermayer, Fabian J. Theis, Christine Kocks, Nikolaus Rajewskyt 


INTRODUCTION: Understanding the differ- 
entiation of stem cells into the vast amount of 
cell types that form the human body is a 
central problem of basic and medical science. 
The recent advances in single-cell sequencing 
techniques now make it possible to capture 
the transcriptomes of thousands of cells in a 
fast and cost-effective manner, opening a new 
way to study the cell composition of organs, 
tissues, and developmental stages. Yet, single- 
cell transcriptomics per se just provides a snap- 
shot of cellular dynamics and transient cell 
populations. Computational algorithms have 
emerged that infer a pseudotemporal ordering 
of cells based on comparison of their tran- 
scriptomic profiles, allowing new insights into 
stem cell biology and tissue differentiation. 
However, these algorithms were designed for 
relatively simple scenarios, such as the differ- 
entiation of cells belonging to a specific lin- 
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eage or the lineage relationships among cells 
from a particular tissue, and cannot evaluate 
all possible cellular differentiation trajectories 
in complex animals. To this end, we use single- 
cell transcriptomic approaches to improve our 
molecular understanding of how stem cells dif- 
ferentiate into the set of cell types that make an 
entire complex adult animal. 


RATIONALE: Freshwater planarians such as 
Schmidtea mediterranea offer a unique oppor- 
tunity to approach this question. These animals 
are immortal and constantly renew and regen- 
erate all tissues owing to the presence of a large 
pool of pluripotent stem cells that continuously 
differentiate into all mature cell types. There- 
fore, we reasoned that an unbiased single-cell 
transcriptomic approach should allow us to 
capture not only terminally differentiated cell 
types but also intermediate cellular states, pos- 


A lineage tree for complex animals from single-cell transcriptomics. Planarians are 
multicellular organisms. They contain adult pluripotent stem cells that continuously renew all 
tissues and differentiate into all adult cell types. Using single-cell transcriptomics, we characterized 
all major mature cell types and many intermediate cellular states. We then derived a lineage 

tree describing planarian stem cell differentiation into all mature cell types of the animal. 
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sibly enabling cell lineage reconstruction of the 
whole animal from transcriptomic data. 


RESULTS: We performed massively parallel 
single-cell transcriptomics profiling of thou- 
sands of cells from adult planarians. At the 
molecular level, we identified and character- 
ized dozens of cell types, including stem cells, 
progenitors, and terminally differentiated cells. 
We then applied a new computational algorithm, 
partition-based graph abstraction (PAGA), which 
can predict a lineage tree for the whole animal 
in an unbiased way. By combining the predic- 
tions from PAGA with several independent 
lines of evidence, includ- 
ing single-cell transcrip- 
Read the full article tome data from purified 
at http://dx.doi. stem cells and stem cell- 
org/10.1126/ depleted animals, analysis 
science.aaql723 of gene expression dynam- 
eee raienre rer enonene ies, and a Teethod called 
velocyto that predicts future gene expression 
from mRNA metabolism, we produced a con- 
solidated lineage tree that included all identi- 
fied cell types rooted to a single stem cell group. 
We used this information to identify gene sets 
co-regulated during the differentiation of many 
specific cell types. To show the power of our ap- 
proach, we applied single-cell transcriptomics to 
regenerating planarians and characterized how 
each cell type in the adult planarian body dy- 
namically responds to regenerative body remod- 
eling at the transcriptomic and cellular levels. 
Our results highlight that some cell types that 
had been previously overlooked in molecular 
studies quickly decrease their abundance, in- 
dicating that they may serve as an energy res- 
ervoir that fuels the regeneration process. 


CONCLUSION: We have shown that it is pos- 
sible to use single-cell transcriptomics to (i) build 
acell atlas of an adult animal, (ii) reconstruct the 
lineage relationships of its cells in an unbiased 
way, and (iii) identify gene sets which likely 
contain genes that are involved in programming 
the lineage tree. Moreover, we demonstrated 
how single-cell transcriptomics can be used to 
study complex and dynamic cellular processes 
such as regeneration. Notably, our approach is 
applicable to other model and non-model or- 
ganisms, assuming that their differentiation pro- 
cesses are sampled with sufficient time resolution. 
To foster future studies, we provide a detailed 
tutorial on the application of our approach, and 
we make our data available through an inter- 
active web interface. This study opens the door 
to powerful approaches for understanding mo- 
lecular mechanisms of development and regen- 
eration in animals. 
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Cell type atlas and lineage tree of 
a whole complex animal by 
single-cell transcriptomics 
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Flatworms of the species Schmidtea mediterranea are immortal—adult animals contain a 
large pool of pluripotent stem cells that continuously differentiate into all adult cell types. 
Therefore, single-cell transcriptome profiling of adult animals should reveal mature and 
progenitor cells. By combining perturbation experiments, gene expression analysis, a 
computational method that predicts future cell states from transcriptional changes, and a 
lineage reconstruction method, we placed all major cell types onto a single lineage tree that 
connects all cells to a single stem cell compartment. We characterized gene expression changes 
during differentiation and discovered cell types important for regeneration. Our results 
demonstrate the importance of single-cell transcriptome analysis for mapping and 
reconstructing fundamental processes of developmental and regenerative biology at 


high resolution. 


nderstanding differentiation from stem 

cells into the different cell types that make 

up the human body is a central problem of 

basic and medical science. Although numer- 

ous mechanisms of cellular differentiation 
have been identified and many cell types have 
been characterized, it will require a huge coor- 
dinated undertaking to systematically map all 
human cell types and cellular differentiation 
states (7). Owing to the advances in single-cell 
transcriptomics, it has already been possible to 
study the cell type composition of mammalian 
organs and tissues (2-6), as well as development 
stages (7, 8). However, single-cell transcriptomics 
provide just a snapshot of the dynamics of the 
cell populations unless cells can be traced or tagged 
experimentally (9-12). Thus, reconstructing cell 
lineages from stem cells to differentiated cells 
remains a challenge. Recently, algorithms to order 
developmental states and compute lineage trees 
based on comparing single-cell transcriptomes 
have made considerable progress (13-15) and 
have revealed insights into stem cell biology (16) 
and tissue differentiation (77-20). However, these 
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algorithms have been developed for the study of 
differentiation in specific cell lineages or tissues 
and are not suitable to reconstruct all the cell 
differentiation trajectories present in complex 
animals. 

Given these problems, can single-cell tran- 
scriptomic approaches improve our molecular 
understanding of how stem cells differentiate 
into all the cell types of an entire complex adult 
animal? Freshwater planarians such as Schmidtea 
mediterranea offer a unique opportunity to an- 
swer this question. Planarians are immortal, as 
they contain as adults a large pool of pluripotent 
stem cells (neoblasts) that continuously differenti- 
ate to all mature cell types to turn over all tissues 
(21). Hence, all cell differentiation pathways are 
constantly active in adult individuals. We there- 
fore reasoned that an unbiased single-cell tran- 
scriptomics approach should yield terminally 
differentiated cell types as well as many inter- 
mediate cellular states, making planarians an ideal 
model system to attempt the lineage reconstruc- 
tion of a whole animal. 

Here, we performed highly parallel droplet-based 
single-cell transcriptomics, Drop-seq (3), to char- 
acterize planarian cell types. We molecularly 
characterized dozens of cell types and uncovered 
many new ones. By applying a newly developed 
algorithm, partition-based graph abstraction 
(PAGA), which reconciles the principles of clus- 
tering and pseudotemporal ordering (22), and 
combining it with independent computational 
and experimental approaches, we derive a con- 
solidated lineage tree that includes all identified 
cell types rooted to a single stem cell cluster. 
Along this tree, we identify 48 gene sets that are 


co-regulated during the differentiation of specific 
cell types. Finally, we used single-cell transcrip- 
tomics to characterize cellular processes that 
happen during regeneration. Our results reveal a 
strong depletion of newly characterized cell types, 
suggesting that these cells are used as an energy 
source for regeneration. 


A high-resolution cell type atlas 
for planaria 


To comprehensively characterize different cell types 
and progenitor stages present in adult planar- 
ians, we performed genome-wide expression profil- 
ing in individual cells using nanoliter droplets 
(Drop-seq) (3) of cells isolated from whole adult 
animals. These cells were obtained, after dissoci- 
ation, by fluorescence-activated cell sorting (FACS), 
which separated intact live cells from dead cells 
and enucleated cellular debris (Fig. 1A). From 11 
independent experiments, we captured a total of 
21,612 cells. We detected on average 494 genes 
and ~970 transcripts [identified by using “unique 
molecular identifiers” (UMIs)] per cell. The in- 
dividual data sets correspond to five wild-type 
samples (10,866 cells), two RNA interference 
(RNAi) samples (3314 cells), a high-DNA con- 
tent G/M population corresponding to cycling 
planarian stem cells (typically defined as x-ray- 
sensitive “X1 cells”; 981 cells) (23, 24), and three 
wild-type regeneration samples (6451 cells; table 
S1). Sequencing depth was comparable across 
samples (fig. S1A). Biological replicates showed 
highly correlated gene expression profiles (fig. 
S1B). In addition, all samples showed high cor- 
relation with published RNA-sequencing (RNA-seq) 
data from equivalent bulk cell populations (24-27) 
(fig. SIC). We pooled and analyzed all single-cell 
data sets together using Seurat (3). Eight of 11 
samples were fixed with methanol (28) or frozen 
with dimethyl sulfoxide (DMSO) (29) to facilitate 
sample handling. To assess batch effects, we 
compared the overall quality across wild-type 
samples. Cells from each batch were distributed 
similarly on the t-distributed stochastic neighbor 
embedding (tSNE) (fig. SID), which resulted in 
comparable proportions of cells per cluster (fig. 
SIE and table S2). Although we observed a mild 
bias in gene expression due to the preservation 
procedure of the samples, clustering was not 
affected (fig. SIF). However, we observed differ- 
ences in the number of UMIs per cell across 
clusters (fig. SIG). Together, these analyses con- 
firmed that sample preparation did not compro- 
mise data quality or introduce bias. Therefore, we 
clustered the expression profiles of the individual 
cells from all samples together using Seurat (3). In 
total, we identified 51 cell clusters (Fig. 1B). 

We elucidated the cell type identity of clusters 
by examining marker genes and comparing them 
to those in previous studies (fig. S2, A and B) 
(supplementary note 1). The largest cluster and 
14 smaller clusters located in the center of the 
tSNE plot express combinations of well-known 
stem cell markers (fig. S3A), such as Smedwi-1, 
Smedtud-I, and bruli (fig. S3B). The remaining 
clusters corresponded to the previously described 
neural, epidermal, secretory, muscle, gut, and 
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Fig. 1. Cell type atlas by single-cell transcriptomics. (A) Experimental 
workflow. (B) tSNE representation of the single-cell transcriptomics data with 
clusters colored according to the expression of previously published marker 
genes as follows: gray, neoblasts; orange, neuronal lineage; red, muscle; purple, 
secretory; blue, epidermal lineages; pink, protonephridia; green, gut; magenta, 
parenchymal lineages. (C) Proportions of cell types identified by Bagufia and 
Romero by microscopy (left) and as identified by tallying up our annotated 
Drop-seq clusters (right). The outer ring shows the proportion of each 
individual cluster, which includes neoblasts, epidermal (epidermal and rhab- 
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body muscle aqp+ parenchymal cells 


general muscle psap+ parenchymal cells 


dite), parenchymal (fixed parenchymal), pigment, neuronal (nerve), muscular, 


gut (gastrodermal and goblet), secretory (acidophilic and basophilic), and 
protonephridia (flame) cells. We did not find “striped” cells in our data set. 


Overall, we find many subtypes for each of the original cell types. (D to F) tSNE 


plots (upper panels) showing the expression of marker genes and their 


expression patterns in adult animals with double in situ hybridizations on tissue 


sections (lower panels). Nuclei in (D) and (F) were stained with Hoechst 


and are shown in blue in the overlay. Scale bars: 100 um. The color scale for 
tSNE plots ranges from light gray (no expression) to blue (high expression). 
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protonephridia cell types (fig. S2, A and B). How- 
ever, in each of these categories, we found several 
distinct clusters (Fig. 1B) that express different 
combinations of marker genes (table S3 and fig. 
S4). This result suggests that our approach can 
distinguish more cell types than previous studies. 


Single-cell transcriptomics unveils 
previously uncharacterized cell types 


In the 1980s, Baguha and Romero used micros- 
copy to morphologically characterize and count 
all major cell types in S. mediterranea (30). We 
used this resource as a reference to validate the 
cell types identified by our Drop-seq data and 
cluster annotation. Even though the microscopy 
data are of a qualitative nature, we observed a 
strong correlation between it and our molecular, 
unbiased Drop-seq annotation (Fig. 1C), suggest- 
ing that FACS sorting, cryopreservation or fixa- 
tion, and cell capture in nanodroplets did not 
influence cell type proportions. We validated the 
identity of several clusters by designing RNA 
probes targeting marker genes and performing 
in situ hybridizations, both whole mount and in 
histological sections (fig. S5). We could confirm 
major known cell types such as different types of 
neurons, muscle, protonephridia, epidermis, and 
secretory cells. We identified the two main cell 
types of the planarian gut: phagocytes (Fig. 1D, 
red) and goblet cells (Fig. ID, green), and dis- 
covered markers of planarian goblet cells, includ- 
ing a gene without apparent homologs in other 
phyla. We named this gene brwixot. We also dis- 
tinguished body and pharynx muscle (Fig. 1E). 
General muscle markers colocalized with body 


A 


@ h20(RNAi) 


@ IANA) 


muscle markers throughout the entire body ex- 
cept in the pharynx (Fig. 1E). Pharynx muscle 
was characterized by the expression of laminin 
(31) (fig. S5). The protonephridia cluster (0.3% of 
our wild-type cells) contained the two main cell 
types of these organs, flame and tubular cells 
(32) (figs. S4 and S5). In some cases, cell clusters 
contain several similar subtypes that we can- 
not distinguish at this resolution. For instance, 
previously described markers of eye pigment 
cup cells and photoreceptor neurons (33) are ex- 
pressed in pigment and ChAT neurons 2 clus- 
ters, respectively, indicating that the former are 
subtypes of the latter (fig. S6). 

We also validated a recently discovered epi- 
dermis cell type, which marks the boundary be- 
tween the dorsal and ventral parts of planarians 
(fig. S5) (34). Additionally, we identified an epi- 
dermal-related pharynx cell type (fig. S5) and sev- 
eral parenchymal cell types previously undescribed 
molecularly (Fig. 1B and fig. S5). Among parenchy- 
mal clusters, we found a diversity of nonoverlap- 
ping cells types, including aqp+ and the psap+ 
parenchymal cells (Fig. 1F), which probably col- 
lectively correspond to the previously described 
fixed parenchymal cells (30, 35), pigment cells 
(cluster 44) (36, 37), and glial cells (38, 39) (cluster 
47) (figs. S4 and S5). Altogether, these results show 
that we can identify known as well as unknown 
cell types using single-cell transcriptomics and 
measure their abundances in a reproducible way. 

To investigate the function of newly identified 
cell types, we used pathway and gene set over- 
dispersion analysis (PAGODA) (40) to identify 
variable gene sets with particular Gene Ontology 


(GO) terms annotated (fig. S7 and supplementary 
note 2). The clustering that emerges with these 
gene sets roughly recapitulates the one obtained 
with Seurat, showing the robustness of our clus- 
tering approach (fig. S7A). This analysis revealed 
that neoblasts and progenitors are functionally 
similar, both expressing gene signatures enriched 
for GO terms related to RNA processing. Addition- 
ally, parenchymal clusters showed enrichment for 
GO terms related to “lysosome,” “extracellular re- 
gion,” and hydrolytic enzymes and appear to share 
metabolic functions with gut cells (fig. S7B). 


Single-cell transcriptomics of purified 
stem cells and stem cell-depleted 
animals reveals stem, progenitor, and 
differentiated cell populations 


The great diversity of cell types identified, which 
included stem cells, differentiated cells, and pre- 
sumably many progenitor cells, offered a unique 
opportunity for exploring stem cell differentia- 
tion and lineage relationships between all cell 
clusters. We focused on the X1 cell sample, which 
is enriched in G,/M neoblasts (23, 24), and the 
histone 2b (h2b) RNAi-treated whole-planaria 
sample, in which stem and progenitor cell pop- 
ulations are depleted (47). Cells from these data 
sets showed a clear distribution pattern: X1 cells 
were located in the middle of the tSNE plot (Fig. 
2A, red dots), whereas h2b(RNAi)-resistant cells 
were clearly enriched in the periphery (Fig. 2A, 
blue dots). This distribution was specific and not 
the result of batch effects, as evident from the 
respective control samples [Fig. 2A, X1 control 
and g/p(RNAi) samples]. Given that each data set 
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Fig. 2. Neoblast ablation and enrichment experiments show stem and 
progenitor clusters. (A) tSNE plots showing the distribution of the 

cells of an X1 FACS-sorted sample (red) and its whole-cell population control 
(x1 control, magenta), and a h2b(RNA/) sample with its negative control 
[gfp(RNA/), green]. X1 cells are enriched in the center of the plot, whereas 
h2b(RNAI) cells are enriched in the periphery. (B) PCA analysis considering 
the expression level of neoblast marker genes and the log odds ratio of 

the amount of cells per cluster from h2b(RNA/) and X1 experiments 
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compared to wild-type and control samples separates neoblasts (gray), 
progenitor clusters (yellow), and differentiated cell clusters (blue). 

The location of these clusters is shown on the tSNE plot on the 

right. (©) Gene expression correlation between bulk RNA-seq data 
from FACS-sorted X1, X2, Xins populations, and whole worms 

and the pooled clusters as defined in (B). Neoblasts show a stronger 
correlation with X1, progenitors with X2, and differentiated cells with 
Xins and whole worms. 
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is enriched in particular cell populations, we 
reasoned that they could be used to distinguish 
cells in varying differentiation states. We quanti- 
fied the fraction of cells per cluster from the X1 
and h2b(RNAZ) samples and compared them to 
wild-type and control samples. We performed a 
principal component analysis (PCA) using these 
cellular proportions as well as the mean expression 
of the three top neoblast markers (Smedwi-I1, 
tub-al, and h2b) in each cluster. The first two 
principal components resulting from this anal- 
ysis separated clusters according to their gene 
expression profiles as neoblasts, progenitors, 
and differentiated cell clusters (Fig. 2B). Map- 
ping onto the tSNE revealed that progenitor 
cell clusters were located between differentiated 
cells and neoblast clusters (Fig. 2B). To corrob- 
orate the differentiation state of the cells in the 
different clusters, we pooled the cells in each 
group and correlated their gene expression pro- 
files to previously described FACS populations. 
Neoblast clusters best correlated with X1 popula- 
tions, corresponding to high-content DNA G./M 
neoblasts; progenitor clusters correlated with 
X2 populations, a mixture of G,/S neoblasts and 
early progenitors; and differentiated cell clusters 
correlated with Xins samples, a pool of all dif- 
ferentiated cells (Fig. 2C). Altogether, our func- 
tional experiments reveal the stem, progenitor, 
or differentiated status of each cell cluster. 


Computational lineage reconstruction 
predicts a single tree for all major 
planarian cell differentiation trajectories 


Existing methods to investigate cell differentia- 
tion using single-cell transcriptomics data were 
designed to study individual lineages or organs, 
allow few branching trajectories (13, 15, 18), and 
often require high sequencing depth and asso- 
ciated costs (16). To overcome these limitations, 
we developed the general framework of PAGA, 
which reconciles clustering and pseudotemporal 
ordering algorithms and allows the inference of 
complex cell trajectories and differentiation trees 
(22). Starting from the neighborhood graph of 
single cells, in which cells are represented as 
nodes, the algorithm quantifies the connectivity 
of cell clusters and generates a much simpler 
abstracted graph in which nodes correspond to 
the clusters identified by Seurat and edges rep- 
resent putative transitions between clusters. 
The differentiation tree is then computed as 
the treelike subgraph in the abstracted graph 
that best explains all continuous progressions along 
the original single-cell graph (supplementary 
note 3). 

When running this algorithm, without any 
assumptions about the tree structure, we ob- 
tained an abstracted graph that shows high 
confidence of the branching events (Fig. 3A) 
from which we can derive a single differentia- 
tion tree that included all the cell types and 
linked them to a single root, the neoblast 1 cluster. 
This tree defines independent differentiation 
branches for all the major tissues such as neu- 
rons, muscle, parenchyma, and gut (Fig. 3A). 
Additionally, the tree reflects the relation between 
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different groups of cells. For example, it predicts 
the existence of independent progenitor cells for 
the epidermis dorsoventral boundary and the 
pharynx cell type lineages, although both lin- 
eages are related to the epidermal lineage. By con- 
trast, it shows the presence of a shared progenitor 
for all parenchymal lineages despite containing cell 
types as different as glia and pigment cells. The 
connections in the tree are highly consistent with 
the continuity of gene expression patterns along 
the various lineages (fig. S8A) except for two 
cases: The epidermis cluster itself is disconnected 
from epidermal lineage, and muscle pharynx is 
connected to muscle body instead of muscle pro- 
genitors (fig. S8A). Together, from 51 clusters 
(with 1275 possible transitions between them), 
PAGA predicts 53 transitions that are mainly 
consistent with our marker-based analysis. 

Furthermore, PAGA yields a pseudotemporal 
ordering of individual cells within each cluster 
consistent with our stem cell ablation and puri- 
fication experiments and therefore confidently 
predicts their differentiation status, even for cell 
types for which separate progenitor clusters could 
not be identified (fig. S8B). For instance, when 
we sorted the goblet cells by pseudotime, we 
observed a higher percentage of X1 cells in ear- 
ly pseudotime and h2b(RNA?Z) cells in the late 
pseudotime (fig. S8B). To validate this observa- 
tion, we performed double FISH (fluorescence 
in situ hybridization) of bruixot, our newly iden- 
tified goblet cell marker, and adb (aprenent de 
bruixot), a gene expressed earlier in the goblet 
cluster pseudotime (fig. S8B). Consistently, adb 
was expressed in the gut (fig. S8C) overlapping 
with bruizxot, but staining more cells located in 
the periphery of the gut that clearly lacked goblet 
cell morphology (fig. S8D). This indicates that 
adb is amarker of immature goblet cells and that 
computationally estimated pseudotime correctly 
orders cells according to their differentiation 
status. 

Although the tree predicts the connectivity of 
cell clusters, it does not give any information 
about the direction of the trajectories. Thus, we 
used the tree topology to estimate the develop- 
mental potency of each cluster, i.e., their ability 
to give rise to other cells. We developed a po- 
tency score that is conceptually similar to the 
stemID score previously proposed to identify stem 
cells (16) but additionally estimates pluripotency 
versus multi- or unipotency of cell populations. It 
is computed as the normalized degree of each 
cluster in the abstracted graph (supplementary 
note 4). This analysis showed that neoblast 1, the 
largest stem cell cluster, had a score of 1 (Fig. 3B), 
correctly assigning pluripotency to neoblasts as 
expected from earlier studies (27). We note that 
the potency score is independent of prior 
information and therefore can be used to iden- 
tify stem cells from single-cell transcriptomics 
data alone, a feature that is particularly useful in 
less well-studied non-model organisms. Progeni- 
tor clusters showed lower potency than neoblasts 
and higher potency scores than differentiated 
cells (Fig. 3B), in agreement with a gradual po- 
tency loss. To assess the stem cell and progenitor 


status of the clusters connected in the center of 
the PAGA topology, we mapped X1 and h2b(RNA?) 
data onto the tree. Most X1 cells were located in the 
neoblast 1 cluster (Fig. 3C), whereas h2b(RNAiZ)- 
resistant cells were more enriched in the leaves 
of the tree (Fig. 3D). Thus, both PAGA and stem 
cell ablation and purification independently sup- 
port the stem and progenitor status of these 
clusters. 

The remaining neoblast clusters had lower po- 
tency scores than the neoblast cluster 1 and were 
connected to it. These clusters share the majority 
of marker genes with the neoblast 1 cluster (table 
$3) and do not correspond to previously identified 
specialized neoblasts of the sigma, gamma, and 
zeta class (26, 42, 43) (fig. S9). Although some of 
these neoblast clusters are connected to differ- 
entiated cell types (Fig. 3A), most do not give rise 
to differentiated cell types, raising the possibility 
that they represent neoblasts in different meta- 
bolic, cell cycle, or activation states (supplemen- 
tary note 5). 

We detect expression of specialized neoblast 
markers among both neoblast and progenitor 
clusters (figs. S10 and S11). Although present in 
neoblasts, sigma markers were most highly ex- 
pressed in neural and muscle progenitors, gamma 
markers in gut, and parenchymal progenitors and 
zeta markers in epidermal progenitors (fig. S9D). 
These clusters are mostly devoid of X1 cells (Fig. 
3C) and therefore correspond mainly to postmi- 
totic progenitors. 


RNA velocity confirms lineage 
relationships predicted by PAGA 


To independently validate the differentiation tra- 
jectories predicted by PAGA, we used velocyto 
(44). This method computes RNA velocity, de- 
fined as the rate of change of mRNA levels for a 
gene in time, in every single cell. In differen- 
tiating cells in which changes in gene expres- 
sion are dominated by changes in transcription 
rates, the ratio of unspliced to spliced reads for 
a given gene within a cell will be proportional 
to the temporal change of the logarithm of 
spliced reads (or mature mRNAs) (44). Thus, one 
can estimate the future mRNA level of a gene by 
computing its velocity and a linear fit. By aggre- 
gating over many genes in a cell, one can esti- 
mate the cellular expression state to which the 
cell is apparently moving in time. We estimated 
mRNA velocities for each cell and projected the 
estimated future cell states onto the tSNE, which 
describe the paths predicted by the mRNA ve- 
locity model (Fig. 3E and fig. S12A). These paths 
show a highly homogeneous stem cell popula- 
tion that moves slowly to progenitors, which will 
differentiate to mature cell types. The long ar- 
rows at the edges of the clusters likely are due to 
the averaging on the force field, as they do not 
appear when individual arrows are plotted (fig. 
$12A). These paths largely agree with the trajec- 
tories predicted by PAGA and also confirmed 
the connection between muscle progenitors and 
pharynx muscle predicted from gene expression 
changes (fig. S12A). Additionally, velocyto can 
also model longer cell trajectories to identify 
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their root (Fig. 3F) and terminal end points (Fig. 
3G), which corresponded to the tSNE regions 
containing stem cells and terminally differenti- 
ated cells, respectively. Velocyto cannot provide 
information from disconnected clusters. As a re- 
sult, all disconnected clusters contain differenti- 


ation trajectories with independent start and 
end points (Fig. 3, F and G). 

The estimates of RNA dynamics obtained with 
velocyto also identified regions where genes are 
mainly induced or repressed compared to the 
steady-state level. This information can be help- 


ful to investigate relations between clusters that 
appear disconnected on the tSNE. We used these 
estimates to study the expression of marker 
genes from the epidermis cluster. These genes 
are clearly induced in epidermal progenitors and 
become repressed in mature epidermis, where 
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Fig. 3. Lineage tree reconstruction by PAGA and velocyto. (A) Abstracted 
graph showing all the possible edges with a probability higher than 

10°© connecting two clusters and their confidence. Each node corresponds 
to each of the clusters identified with Seurat. The size of nodes is 
proportional to the amount of cells in the cluster. The most probable path 
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(C) or h2b(RNAi)-resistant (D) cells in each cluster. X1 cells are most 
abundant in the neoblast 1 cluster, whereas h2b(RNA/)-resistant cells are 
mostly located in the leaves of the tree. (E) Velocyto force field showing the 
average differentiation trajectories (velocity) for cells located in different parts of 
the tSNE plot. (F and G) Root (F) and terminal end points (G) obtained after 
modeling the transition probabilities derived from the RNA velocity by using a 
Markov process. The color scale represents the density of the end points of 
the Markov process and ranges from yellow (low) to blue (high). 
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they are highly expressed (fig. S12B). Thus, mRNA 
metabolism patterns provide additional sup- 
port to the differentiation trajectory connect- 
ing late epidermal progenitors to epidermis that 
we predicted on the basis of gene expression 
changes (fig. S8A). 


A consolidated lineage tree of planarian 
stem cell differentiation into all major 
cell types 


Taken together, our results show that both com- 
putational and experimental methods agree in 
the identification of stem cells, progenitors, and 
differentiated cells. By combining all four inde- 
pendent lines of evidence (PAGA, gene expres- 
sion changes, stem cell ablation and enrichment 
experiments, and velocyto), we provide a single 
consolidated tree that models stem cell differ- 
entiation trajectories into all identified cell types 
of adult planarians (Fig. 4A). The resulting cell 
lineage tree correctly recapitulates the known 
expression changes described during epider- 
mal differentiation (26, 34, 45) (Fig. 4B). We 
observed a continuous decrease of the expres- 
sion of Smedwi-1, a well-characterized neoblast 
marker (fig. $3), with pseudotime progression, 
whereas early (prog-D) and late (agat-D) epidermal 
progenitor, as well as mature epidermis (vim-) 
markers, increased their expression at consecutive 
time points (Fig. 4B). 

According to the consolidated lineage tree, 
neoblasts (35% of our wild-type cells) differenti- 
ate into at least 23 independent cell lineages. 
There are six major differentiation fates (57% of 
cells) (table $2), each representing more than 1% 
of total cells: epidermal, parenchymal, neural, 
muscle, gut, and a pharynx cell type. For these 
major fates, we identified progenitor and differ- 
entiated states. Additionally, we identified 10 mi- 
nor lineages (6% of cells; each less abundant 
than 1% of total cells) that differentiate from 
the neoblasts, but for which we were unable to 
identify progenitors. 


Self-organizing maps identify gene 
programs underlying cell differentiation 


We used our data to identify gene sets that co- 
ordinately change their expression during differ- 
entiation. For this analysis, we discarded all cells 
from neoblast clusters that did not give rise to 
differentiated cell types in our consolidated cell 
lineage tree. The remaining cells were ordered 
following the tree for each lineage and sorted 
within each cluster according to their pseudo- 
temporal ordering (Fig. 5A). Subsequently, we 
used self-organizing maps (SOMs) (46) to identify 
48 sets of highly variable genes that coordinately 
change their expression during differentiation 
(47) (Fig. 5B, fig. S13, and table S4). Many of these 
sets contain some genes previously known to be 
expressed in the respective lineages and in some 
cases involved in their differentiation (table S5). 
For instance, gene sets 10 and 11 contain genes 
that are highly expressed in neoblast and pro- 
genitor clusters, such as Smedwit-I and tub-al, 
whose expression drops during differentiation 
(Fig. 5B and fig. S13). Similarly, we found gene 
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Fig. 4. Consolidated lineage tree of planarian stem cell differentiation into all major types. 


(A) Consolidated lineage tree including four independent sources of evidence. The topology of the tree 
is shown according to PAGA, and marker-based connections are shown with red edges. Velocyto- 
supported connections are shown with thick edges. Progenitor and differentiated cell clusters according 
to neoblast ablation and enrichment experiments are shown with yellow and blue halos, respectively. 
(B) Gene expression changes of marker genes for the individual stages during epidermal differentiation 


(in pseudotime). Relative expression of marker genes from neoblast (Smedwi-1), early (prog-1) and 
late (agat-1) progenitors, as well as from the epidermis (vim-1). A maximum of 1000 cells from neoblast 
1, epidermal neoblasts (en), early epidermal progenitors (eep), late epidermal progenitors 1 (lep 1) 

and 2 (lep 2), and epidermis were sampled. Gray thin dashed lines show the expression of Smedwi-1 after 
randomly permuting cells (rand1) or after randomly sorting cells within each cluster (rand2). 


sets that are regulated along muscle, neuronal, 
parenchymal, gut, and epidermal differentia- 
tion (Fig. 5B, top row). They contain genes 
expressed in these lineages, such as mhc for the 
muscle and chat for the neuronal lineage, but 
also include well-known regulators of their dif- 
ferentiation, such as myoD (48) and coe (49). As 
a consequence of analyzing all detected pla- 
narian cell lineages simultaneously, we not only 
identified gene sets involved in lineage specific 
programs but also gene sets co-regulated during 
the differentiation of several fates (Fig. 5B, mid- 
dle and bottom row). Taken together, these re- 
sults show that single-cell transcriptomics of a 
whole organism allows the reconstruction of 
specific differentiation events for many differ- 
entiation fates in parallel, enabling the identi- 


fication of previously undetected combinations 
of co-regulated genes. 


Molecular profiling of 
planarian regeneration by 
single-cell transcriptomics 


Freshwater planarians are well known for their 
remarkable regenerative capacities. Planarians can 
be cut into small pieces, and each piece (except 
for the pharynx and the most anterior tip of the 
head, which are devoid of neoblasts) can regen- 
erate a complete, albeit much smaller, organism 
in a matter of days. This process is dynamically 
complex and involves the orchestration of all cel- 
lular differentiation pathways. The animal does 
not grow (as it cannot eat) during the process. 
Thus, the truncated body fragments need to 
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reshape their body proportions to adjust to their 
new size by a process termed morphallaxis (50). It 
is still largely unknown how each individual cell 
type behaves in this process. 

Given the detected cell type abundances and 
the cell differentiation tree of steady-state adult 
animals, we asked if we could use Drop-seq to 
profile the cellular and transcriptomic changes 
that occur during regeneration. We cut planar- 
ians into five to seven pieces, discarded the head 
piece, and prepared the remaining body pieces 
for single-cell transcriptomics immediately after 
cutting (day 0), and 2 and 4 days after cutting 
(Fig. 6A and fig. S14). We compared regenerating 
samples to those at day 0 using Seurat and de- 
tected hundreds of differentially expressed genes 
in both samples (tables S6 and $7). By pooling all 
cells, we could detect up-regulation after 2 days 
of regeneration of 16 of the 128 wound-induced 
genes described in a previous study (42, 57) (table 
S8 and fig. S15A). The shallowness of Drop-seq 
data makes it difficult to assess differences in 
lowly expressed genes. However, Drop-seq allows 
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the cell types that undergo these changes to be 
distinguished, showing that runt-1 and egr-2 are 
up-regulated in the neoblast 1 cluster (fig. S15B) 
and jwn-1 in the muscle body cluster (fig. S15C 
and tables S6 and S7). 

All cells from the regenerating samples fall 
into clusters that are present in wild-type sam- 
ples, indicating an absence of regeneration- 
specific types or trajectories (table S2). However, 
our analysis revealed notable changes in cell 
composition during regeneration (Fig. 6B and 
fig. S16). On one hand, we observed a large in- 
crease in the number of neoblasts, consistent 
with an increase in mitotic activity, and of neu- 
ral progenitors, reflecting active neurogenesis 
to replace missing brain structures after head 
removal. On the other hand, we detected that 
both parenchymal progenitor cells and differ- 
entiated parenchymal cell types were depleted 
(Fig. 6B), indicating that these cells are cleared in 
the process of reshaping the planarian tissue. 
The cell proportion changes at days 2 and 4 
were clearly correlated (Fig. 6C), indicating that 
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Fig. 5. Identification of gene sets regulated and co-regulated in cell differentiation. (A) Schematic 
workflow of the analysis performed to identify gene sets involved in the differentiation processes. 
Pseudotemporal ordering of the cells from all lineages and clustering of variable genes using SOMs 
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aqp-positive parenchymal cells are the most de- 
pleted cell type. We experimentally confirmed this 
observation by in situ hybridization on planarian 
tissue sections (Fig. 6D) and counting agp-positive 
parenchymal cells (Fig. 6E) (Mann Whitney U test 
p-value < 10~’). Our results indicate that parenchy- 
mal cells are highly depleted upon regeneration, 
implying that they may be used to metabolically 
fuel the regeneration process (52). 


Discussion 


In this study, we made use of the stem cell pop- 
ulation and the extreme regeneration capabil- 
ities of adult flatworms to generate an atlas of 
cell types at high resolution. We identified, quan- 
tified, and molecularly characterized 37 cell types, 
including 23 terminally differentiated cell types, 
and numerous progenitor and stem cell clusters. 
Although our sequencing data are relatively shal- 
low, molecular characterization of cell types with 
computational methods was robust, agreed well 
with previously published microscopy data, and 
revealed progenitor and differentiated cells. This 
implies that the grouping of incomplete tran- 
scriptomes of thousands of cells into clusters did 
not suffer appreciably from capture rates or other 
confounding factors. 

The resolution of our data depends on both 
the number of cells sequenced and the number 
of genes detected per cell. Considering only wild- 
type and control samples (~11,000 cells), we can 
identify differentiated cell clusters containing 
about 10 cells. Therefore, we estimate that cells 
present at a frequency of <1:1000, such as cintillo+ 
cells (53) and photoreceptor neurons (33), will 
be missed by our approach. Furthermore, we 
failed to identify certain neoblast subpopulations 
previously described in the literature (26, 42). This 
result could be due to the low sensitivity of Drop- 
seq, which captures only a fraction of mRNAs in 
a cell. However, we do detect the expression of 
the proposed marker genes of these subpopula- 
tions. They appear to be spread among neoblasts 
and progenitor clusters (fig. S9), which still ex- 
press neoblast markers such as Smedwit-1 at low 
levels (figs. S10 and S11). This result indicates that 
the boundary between stem cells and lineage- 
committed progenitors is probably not sharp. 
Further studies will help in describing and de- 
limiting these boundaries. 

Projecting high-dimensional gene expression 
data of thousands of transcriptomes onto a two- 
dimensional plot [for example, by the widely 
used tSNE method (54)] visually reveals clusters. 
However, it is impossible to infer the relationships 
among them, as the distances between clusters 
cannot be interpreted as differentiation trajecto- 
ries. To solve this problem, we used computa- 
tional and experimental methods to reconstruct 
a lineage tree. PAGA and velocyto provide two 
complementary approaches to study cell differ- 
entiation by using single-cell transcriptomics. 
Whereas velocyto allows the differentiation tra- 
jectories (including their directionality) of indi- 
vidual cells to be located within a cell continuum 
on the basis of mRNA metabolism, PAGA allows 
the average differentiation paths of a group of 
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Fig. 6. Molecular profiling of regeneration by single-cell transcriptomics. 
(A) Experimental workflow: Planarians were cut into small pieces; head 
pieces were discarded and the remaining pieces were processed for single- 
cell RNA sequencing O, 2, and 4 days after cutting. (B) Quantification of 
neoblasts, neural progenitors, and differentiated clusters and parenchymal 
progenitors and differentiated clusters. Significant differences calculated 
using Fisher’s exact test with an adjusted p-value < 0.001 are marked 


cells to be inferred, even when they are discon- 
nected. Thus, combining these two computational 
methods results in a robust lineage prediction. 
This prediction is supported by the continuity 
of expression of marker genes and the mapping 
of stem cells and differentiated cells on the tree 
(Fig. 4A), and validates known differentiation 
trajectories such as that of the epidermal lineage 
(Fig. 4B) (34). 

We used PAGA (22) to reconstruct in an un- 
biased way the lineage tree of all major planarian 
cells. This method, although indirect, allows the 
lineage information to be reconstructed from the 
transcriptomic snapshot of individual cells. Addi- 
tionally, in contrast to high-throughput lineage- 
tracing methods (9-12), which rely on the use of 
transgenic or CRISPR-Cas tools, it can be applied 
to every species provided that single cells can be 
isolated and sequenced. Using this method, we 
identified de novo planarian stem cells and pre- 
dict their differentiation paths to at least 23 dif- 
ferent lineages, including several multipotent 
progenitor populations. Notably, these tools can 
readily be applied to other organisms to identify 
de novo stem cells, identify their differentiation 
trajectories, and estimate the developmental po- 
tency of the resulting cell populations. 

Pseudotemporal ordering of cells along these 
lineages allowed us to discover gene sets that 
are putatively involved in differentiation pro- 
grams, highlighting the similarities and differ- 
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ences that exist across tissues and identifying 
several genes known to be involved in cell dif- 
ferentiation not only in planarians but also in 
other species, such as myoD, nkx6, and pax6. 
Further characterization of these gene sets 
should be the subject of future studies. As we 
show for the h2b RNAi phenotype, Drop-seq 
also allows profiling of perturbation studies at 
both the transcriptomic and cellular levels. In 
general, and beyond planaria, we can foresee that 
future studies will use single-cell transcriptomics 
coupled to loss-of-function experiments to un- 
ravel the specific developmental functions of 
genes or regulatory networks. 

Furthermore, we used single-cell transcripto- 
mics to profile cellular abundance changes upon 
regeneration. Our experiments revealed that sev- 
eral of our newly described parenchymal cell 
types are depleted in regeneration. These cell 
types had been largely overlooked in molecular 
studies but had been described, on the basis of 
microscopy, in the literature decades ago. This 
is in part due to the unbiased nature of both 
microscopic and single-cell transcriptomic data. 
These parenchymal cells are highly enriched in 
lysosomes and other vacuoles and might be an 
energy reservoir that regenerating planarians 
mobilize to fuel regeneration. 

To make our data easily accessible, we built an 
interactive app that allows users to query and 
interpret all sequencing data (https://shiny.mdc- 
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with ** (C) Cluster outlines colored according to the log2 (odds ratio) of 
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berlin.de/psca). We also provide a detailed tuto- 
rial for the lineage reconstruction algorithm 
PAGA that we hope will serve as a reference for 
future studies (https://github.com/rajewsky-lab/ 
planarian_lineages). Together, our results show 
that single-cell expression profiling can be used 
to systematically annotate cell types of entire 
animals, to reconstruct stem cell differentiation 
lineages and identify gene sets that likely contain 
genes involved in programming them, and to study 
complex processes such as regeneration (and their 
relation to lineages used in normal development) 
at single-cell resolution. Our results and methods 
demonstrate that single-cell approaches will be- 
come an indispensable method to study develop- 
mental and regeneration biology. 


Methods summary 


Single-cell transcriptomic profiling of asexual adult 
planarians from the species S. mediterranea was 
performed with Drop-seq. Single-cell suspen- 
sions were prepared by dissociating cells from 
adult planarians with trypsin. We used FACS to 
discard broken and dead cells. Cells were either 
directly processed for Drop-seq or preserved in 
methanol or DMSO for later processing. For RNAi 
experiments, animals were injected with double- 
stranded RNA against the coding region of h2b 
or gfp for three consecutive days and kept at 
20°C; their cells were prepared for FACS and 
single-cell transcriptomics 5 days after the third 
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injection. For regeneration experiments, animals 
ranging from 4 to 10 mm in size were cut into 
five to seven pieces; the head pieces were dis- 
carded, and the remaining pieces were processed 
for Drop-seq immediately, or 2 or 4 days after 
cutting. 

Computational analysis of the sequenced sam- 
ples was done with Drop-seq tools and the Seurat 
package (3). Briefly, reads were mapped to the 
S. mediterranea dd_Smed_v6 transcriptome and 
processed with Drop-seq tools and custom perl 
scripts to generate Digital Gene Expression (DGE) 
matrices for each sample. Finally, all DGE matrices 
were joined. Variable genes across all clusters were 
used to perform a PCA. The first 50 principal com- 
ponents obtained were then tested for statistical 
significance, and those with a p-value < 10° 
were used to perform clustering. The robustness 
of the obtained clusters was assessed, and spu- 
rious clusters were merged to obtain a final set 
of 51 clusters. 

Cell type identification was performed by cal- 
culating marker genes for each cluster. Manual 
inspection, comparison to previously published 
single-cell data, and experimental validation using 
in situ hybridizations of the marker genes re- 
ported allowed the identification of the different 
cell populations. Additional characterization of 
the identified cell types was performed by char- 
acterizing GO-term-based gene sets with PAGODA 
(40). Experimental validation of cell types was 
done by whole-mount in situ hybridization and 
in situ hybridization on histological sections as 
previously described and using probes comple- 
mentary to marker genes (table S9). 

Lineage reconstruction was done by com- 
bining the unsupervised graph obtained with 
the PAGA algorithm (22) with velocyto (44), gene 
expression analysis, and experimental data from 
h2b(RNA1) and X1 FACS-sorted cells. To calcu- 
late RNA velocity with velocyto, we mapped the 
reads from all data sets to the planarian genome 
to extract spliced and unspliced reads. These 
analyses allowed pseudotemporal ordering of 
cells that was used to identify gene sets that 
change during stem cell differentiation by using 
self-organizing maps. 

To perform cell counting of regenerating planar- 
ians, positive cells were automatically counted by 
using a custom script for ImageJ (https://imagej.net). 
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Gut microbiome-mediated bile 
acid metabolism regulates liver 
cancer via NKT cells 
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INTRODUCTION: Primary liver tumors and 
liver metastasis currently represent the lead- 
ing cause of cancer-related deaths. The liver 
intimately cross-talks with the gut and per- 
forms many essential functions related to 
digestion, metabolism of nutrients, and clear- 
ance of bacterial metabolites. Diseased livers 
are often associated with altered gut bacterial 
composition, or dysbiosis, and it has been sug- 
gested that gut bacterial products contribute 
to malignant transformation of hepatocytes. 
The liver is exposed to the gut microbiome 
through the portal vein and is an immuno- 
logical organ that is heavily populated by im- 
mune cells. Emerging studies have shown 
that gut commensal bacteria are important 
regulators of antitumor immunity. Although it 
has been established that the gut microbiome 


NKT cells control 
tumor growth. 
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influences the efficacy of cancer immuno- 
therapy, the role of gut bacteria in antitumor 
surveillance in the liver is poorly understood. 


RATIONALE: The liver is exposed to gut bac- 
terial metabolites and products by way of blood 
from the intestine, which comprises 70% of the 
whole liver blood supply. Changes in the gut 
microbiome may affect immune cell function in 
the liver, and commensal bacteria can mediate 
the metabolism of primary into secondary bile 
acids, which recirculate back into the liver 
through the enterohepatic circulation. Given 
that bile acids are known to be involved in 
liver cancer development, we focused on the 
role of bile acids in immunosurveillance of 
tumors growing in the liver. We altered gut 
bacteria and examined changes of hepatic im- 
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Gut microbiome modulates liver cancer through bile acid—regulated NKT cells. Gut 
microbiome uses bile acids as a messenger to regulate chemokine CXCL16 level on liver sinusoidal 
endothelial cells (LSEC) and thus controls the accumulation of CXCR6* hepatic NKT cells. The 
accumulated NKT cells have an activated phenotype and inhibit liver tumor growth. 
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mune cells and antitumor immunity directed 
against liver tumors. Uncovering how the gut 
microbiome uses bile acids to shape immunity 
to liver cancer may have future therapeutic 
applications. 


RESULTS: Using one primary liver model and 
three liver metastasis models, we found that 
altering commensal gut bacteria induced a liver- 
selective antitumor effect. A selective increase of 
hepatic CXCR6* natural killer T (NKT) cells was 
observed, independent of mouse strain, gender, 
or presence of liver tumors. The accumulated 
hepatic NKT cells showed an activated pheno- 

type and produced more 
interferon-y upon antigen 
Read the full article Stimulation. In vivo stud- 
at http://dx.doi. ies using both antibody- 
org/10.1126/ mediated cell depletion 
SAG eS! and NKT-deficient mice 
confirmed that NKT cells 
mediated the inhibition of tumor growth in the 
liver. Further investigation showed that NKT 
cell accumulation was regulated by the expres- 
sion of CXCLI16, the solo ligand for CXCR6, on 
liver sinusoidal endothelial cells, which form 
the lining of liver capillaries and the first bar- 
rier for the blood coming from the gut enter- 
ing the liver. Primary bile acids increased CXCLI6 
expression, whereas secondary bile acids showed 
the opposite effect. Removing gram-positive bac- 
teria by antibiotic treatment with vancomycin, 
which contains the bacteria mediating primary- 
to-secondary bile acid conversion, was sufficient 
to induce hepatic NKT cell accumulation and 
decrease liver tumor growth. Feeding secondary 
bile acids or colonization of bile acid-metabolizing 
bacteria, reversed both NKT cell accumulation 
and inhibition of liver tumor growth in mice 
with altered gut commensal bacteria. In non- 
tumor liver tissue from human patients with 
primary liver cancer, primary bile acid cheno- 
deoxycholic acid (CDCA) levels correlated with 
CXCLI6 expression, whereas an inverse corre- 
lation was observed with secondary bile acid 
glycolithocholate (GLCA), suggesting that the 
finding may apply to humans. 


CONCLUSION: We describe a mechanism by 
which the gut microbiome uses bile acids as 
messengers to control a chemokine-dependent 
accumulation of hepatic NKT cells and anti- 
tumor immunity in the liver, against both pri- 
mary and metastatic liver tumors. These findings 
not only have possible implications for future 
cancer therapeutic studies but also provide a 
link between the gut microbiome, its metab- 
olites, and immune responses in the liver. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: tim.greten@nih.gov 
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Gut microbiome-mediated bile 
acid metabolism regulates liver 
cancer via NKT cells 


Chi Ma,’ Miaojun Han,’ Bernd Heinrich,’ Qiong Fu,’ Qianfei Zhang,’ Milan Sandhu,’ 
David Agdashian,' Masaki Terabe,” Jay A. Berzofsky,” Valerie Fako,® Thomas Ritz,* 
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Primary liver tumors and liver metastasis currently represent the leading cause of cancer- 
related death. Commensal bacteria are important regulators of antitumor immunity, and 
although the liver is exposed to gut bacteria, their role in antitumor surveillance of liver tumors is 
poorly understood. We found that altering commensal gut bacteria in mice induced a liver- 
selective antitumor effect, with an increase of hepatic CXCR6* natural killer T (NKT) cells 

and heightened interferon-y production upon antigen stimulation. In vivo functional studies 
showed that NKT cells mediated liver-selective tumor inhibition. NKT cell accumulation was 
regulated by CXCL16 expression of liver sinusoidal endothelial cells, which was controlled by gut 
microbiome-mediated primary-to-secondary bile acid conversion. Our study suggests a link 
between gut bacteria—controlled bile acid metabolism and liver antitumor immunosurveillance. 


he gut microbiome has emerged as a cri- 

tical factor regulating antitumor immunity 

controlling the efficacy of chemo- and im- 

munotherapies (7-6). It is noteworthy that 

the liver is exposed to bacterial components 
and metabolites through the portal vein, and pro- 
found effects of the gut microbiome on hepato- 
cellular carcinoma (HCC) have been described 
(7, 8). Secondary hepatic malignancies (liver 
metastases) account for 95% of all hepatic cancers, 
and the liver is the most common site for organ 
metastasis in the body (9). To evaluate how the 
gut microbiome shapes antitumor immunity in 
the liver, we investigated the effects of gut com- 
mensal bacteria on both primary HCC and liver 
metastasis in mouse models. 


Alterations in the gut microbiome 
suppress liver tumors in multiple 
mouse models 


Spontaneous HCC was induced using MYC trans- 
genic mice as described before (10). An antibiotic 
cocktail (ABX, consisting of vancomycin, neomycin, 
and primaxin) was added to drinking water to 
deplete gut commensal bacteria (3). The anti- 
bacterial efficacy of ABX was confirmed, and 
the cocktail was not toxic to the liver (fig. S1, A 


to C). Consistent with previous findings, fewer 
and smaller HCC were found in ABX-treated 
MYC mice (Fig. 1A and fig. S1, D and E). We ex- 
tended our studies to a subcutaneous implanta- 
tion model (EL4 thymoma) to study potential 
systemic effects. ABX treatment did not affect 
the growth of subcutaneous EL4 tumors in syn- 
geneic C57BL/6 mice (fig. SIF). In contrast, fewer 
spontaneous liver metastases were seen in mice 
with large subcutaneous EL-4 tumors upon ABX 
treatment (Fig. 1B). To confirm this liver-selective 
antitumor effect, we used an intrasplenic tumor 
injection model (77) and found a robust decrease 
of B16 liver metastasis (Fig. 1C). Unlike in the 
liver, lung metastases were increased by ABX 
when the same B16 tumor cells were injected 
into the tail vein (fig. S1, G and H). Similar re- 
sults were observed in BALB/c mice using A20 
tumors (12) (Fig. 1D and fig. S1, I and J). These 
findings suggest that modulating gut commensal 
bacteria can specifically modify growth kinetics 
of intrahepatic tumors. 


Hepatic NKT cell accumulation precedes 
tumor inhibition 


To explore the mechanism behind tumor sup- 
pression, we studied immune cell subsets in EL4- 


tumor-bearing mice on ABX treatment. A 
prominent expansion of hepatic natural killer 
T (NKT) cells and CD8" T cells was found (Fig. 2A 
and fig. S2A), whereas no changes were found 
in other immune subsets (B cells, CD4* T, NK, 
y/5 T cells and G-MDSC). The accumulation of 
hepatic NKT cells, but not CD8* T cells, was also 
observed in ABX-treated MYC mice bearing HCC 
(Fig. 2B and fig. S2B), and splenic NKT cells re- 
mained unchanged, suggesting a liver-specific 
effect (fig. S2C). 

To understand how the gut microbiome med- 
iates hepatic NKT cell accumulation, tumor-free 
mice were used. ABX-treated C57BL/6 or BALB/c 
mice had more absolute and relative hepatic NKT 
cells than untreated mice, which was not ob- 
served in the spleen and was independent of 
gender (Fig. 2, C to E, and fig. S2, D, E, G, and H). 

The chemokine receptor CXCR6 mediates NKT 
cell survival and accumulation in the liver (13), 
and we found that all hepatic NKT cells expressed 
CXCR6 (Fig. 2F) as expected. NKT cells make 
up the majority of hepatic CXCR6G* cells (~75%) 
(Fig. 2G), which did not change after ABX treat- 
ment (Fig. 2H). ABX treatment caused a ~2-fold 
increase of CXCR6*" cells in the liver of C57BL/6 
and BALB/c mice (Fig. 21). CXCR6 is also ex- 
pressed on T cells (74), and we observed that both 
hepatic CXCR6*CD62L°”CD44" effector memory 
CD4* and CD8* T cells increased after ABX treat- 
ment (fig. $2, J to L). The observation that most 
NKT cells were effector memory CD44"CD62L°” 
cells (fig. S3A) prompted us to check activation 
status. Almost all hepatic NKT cells were the 
activated CD69" subtype in both tumor-bearing 
(fig. S3, B and C) and tumor-free mice (fig. S3D). 
CD25 and 4-1BB, two additional activation mar- 
kers, did not change in hepatic NKT cells after 
ABX treatment (fig. S3, E and F). The observed 
higher CD69 expression but no change in 4-1BB 
activation level suggests that the NKT cells in 
ABX-treated mice are more active (Fig. 2K and 
fig. S3, J to K). NKT cells can exert diverse func- 
tions by rapidly releasing cytokines after activa- 
tion. Thus, cytokine expression was measured 
from NKT cells after in vivo antigen-specific stim- 
ulation by injecting mice with o-galactosylceramide 
(aGalCer)-loaded tumor cells. Higher interferon-y 
(IFN-y) was detected in hepatic NKT cells from 
mice that received ABX treatment, whereas tumor 
necrosis factor (TNF) levels did not change (Fig. 2 
and fig. S3, G to I). IFN-y production by NKT cells 
has been shown to be instrumental for NKT- 
initiated tumor immunity. These data suggest 
that depleting gut commensal bacteria endows 
hepatic NKT cells with a stronger antitumor 
function. No change was observed for in vivo 
cytotoxicity of NKT cells after ABX treatment 


(fig. S3, L and M). 
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Finally, we studied NKT subpopulations based 
on the expression of transcriptional factors (15). 
The majority of hepatic NKT cells were NKT1, 
and the levels of NKT subsets did not change 
(fig. S3N). Interestingly, PLZF, which is required 
for NKT development (16), significantly decreased 
in the liver after ABX treatment (fig. S30), which 
was not apparent in the spleen (fig. $3, P and Q). 

Regulatory T cells (T,.eg;) are important mod- 
ulators in tumor progression. The gut microbiome 
has been reported to affect T,.., (17); however, we 
found no change in Foxp3*CD4* subsets in either 
the liver or spleen of ABX-treated C57BL/6 mice 
(fig. S3, R and S). Taken together, our results 
showed that altering the gut microbiome caused 
accumulation of hepatic NKT and effector memory 
CD4* or CD8* T cells. The activated status of 
NKT cells leads to higher levels of IFN-y upon 
encounter with antigen-loaded tumor cells, thereby 
fostering a tumor-rejecting environment. 


Hepatic NKT cells mediate 
tumor inhibition 


Antibody-mediated cell depletion was performed 
to investigate the specific function of individual 
immune cell populations controlling liver tumor 
immunity in ABX-treated mice. ABX-pretreated 
C57BL/6 mice were given intrasplenic injection 
of B16 tumor cells. T cell depletion was performed 
1 day before tumor injection (Fig. 3A). Removing 
all the three major hepatic T cell subsets (CD4* T, 
CDs* T, and CD4* NKT cells) completely reversed 
the inhibition of liver metastasis caused by elim- 
ination of gut commensal bacteria (Fig. 3, B to D, 
and fig. $4, A to C), whereas depleting CD8* 
T cells alone only had minor effects. 

The antitumor activity of NKT cells can cause 
direct killing of CD1d-expressing tumors (78-20), 
and we noted that all three (B16, ELA, and A20) 
of our tumor models tested expressed CD1d (fig. 
S4D). In addition, NKT cells can recognize tumor 
antigen from CDi1d"® tumors through cross- 
presentation by professional antigen-presenting 
cells (27). To investigate the role of NKT cells in 
CDid-expressing tumors, we used CD1d-knockout 
mice (which completely lack NKT cells) (22) and 
CXCR6-knockout mice (which have a selective 
NKT deficiency in the liver) (73). We confirmed 
loss of hepatic NKT cells in these mice (fig. S4E) 
and then induced liver tumors by intravenous 
injection of EL4 tumor cells (23). Depleting gut 
microbiome reduced EL4 liver tumor burden in 
wild-type mice, but no reduction in liver tumor size 
was found in either CD1d-knockout or CXCR6- 
knockout mice after ABX treatment (Fig. 3, E 
and F). Intrasplenic injection of B16 tumor 
cells was repeated in CD1d-knockout or CXCR6- 
knockout mice, and similar results were observed 
(fig. S4, F to H). These findings suggest that 
hepatic NKT cells are necessary for effects on 
tumor growth in the liver induced by changes 
in the gut microbiome. 


A bile acid/CXCL16/CXCR6 axis controls 
hepatic NKT accumulation 


Virtually all hepatic NKT cells express CXCR6 
(Fig. 2F), and ABX treatment increased the ac- 
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cumulation of CXCR6*" cells into the liver (Fig. 21), 
whereas CXCR6 mean fluorescence intensity 
did not change on NKT cells (fig. S5A). Therefore, 
CXCLI16, the only ligand for CXCR6 (24), was 
further studied. As expected, higher Cxcl16 
mRNA levels were found in the liver of ABX- 
treated mice (Fig. f4A). This increase of Cxcll6 
mRNA was not detected in the lung (fig. S5B). 
Liver sinusoidal endothelial cells (LSECs) have 
been reported to be the major source of CXCL16 
production in the liver (13). To identify the source 
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of CXCLI16, we isolated LSECs from ABX-treated 
mice. Figure 4B shows that there was an almost 
2-fold increase of Cxcl1G6 mRNA in LSECs from 
ABX-treated mice. Consistent with the previous 
report, LSECs have a much higher basal level of 
Cacl16 mRNA, and ABX treatment did not affect 
Cxcl16 mRNA expression in non-LSECs. The in- 
crease of CXCL16 protein in LSECs was confirmed 
by immunohistochemistry staining (Fig. 4C). In 
addition, forced CXCLI16 expression in the liver 
increased hepatic NKT levels (fig. S5C). Together, 


Fig. 1. Depleting gut commen- 
sal bacteria inhibits liver 
tumor growth in multiple 
mouse models. (A) MYC trans- 
gene was turned on at the age 
of 4 weeks. MYC-ON mice were 
fed with ABX or regular water. 
Ten weeks later, mice were 
killed and liver surface tumor 
nodules were counted. 
Representative liver images 
are shown. Data represent 
mean + SEM of two pooled 
experiments. n = 5 for H20, 6 
for ABX. P < 0.05, Student's 
t test. (B) C57BL/6 mice were 
treated with ABX or HzO for 
3 weeks before receiving 
subcutaneous EL4 tumor cell 
x injection. Four weeks later, 
liver metastases were 
determined. Representative 
images of five pooled 
experiments are shown. n = 17 
for H20, 12 for ABX. P < 0.05, 
Chi-square test. (©) ABX- or 
H20-pretreated C57BL/6 mice 
were given intrasplenic B16 
tumor cell injection. One and 
a half weeks later, liver 
metastases were measured. 
Representative images are 
shown. Data represent mean + 
x SEM of five pooled experiments. 

n = 18 for HO, 15 for ABX. 

P < 0.05, Student's t test. 

(D) BALB/c mice were treated 


12 wks 


504 Pa . with ABX or H2O for 3 weeks. 


Then mice received A20 tumor 
cell tail vein injection. Three 
weeks later, liver metastases 
were counted. Representative 
images are shown. Data 
represent mean + SEM of two 
pooled experiments. n = 8, P< 
0.05, Student's t test. 
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these results suggest that ABX treatment causes 
LSECs to produce more CXCLI16 and recruits 
NKT cells to the liver. 

CXCI16 has both cell-surface and secreted forms; 
the cell-surface form has been identified as a 


20 


scavenger receptor for phosphatidylserine and 
lipoprotein and is involved in lipid metabolism 
(25). Gut commensal bacteria are well known 
to mediate bile acid metabolism in the intestine, 
and the gut microbiome reportedly regulates bile 
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acid composition (26). Therefore, we searched 
for a possible link between bile acids and CXCLI16. 
First, we used cholestyramine, a bile acid se- 
questrant, to block the enterohepatic circula- 
tion, thus reducing bile acid levels in the liver (27). 
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Fig. 2. Hepatic NKT cells increase after depleting gut commensal 
bacteria. (A) ABX- or H20-pretreated C57BL/6 mice were given 
subcutaneous EL4 tumor injection. Two and a half weeks later, liver 
infiltrating immune cells were measured. Data represent mean + SEM of 
three pooled experiments. n = 15 for EL4-H20, 20 for EL4-ABX. P < 0.05, 
two-way analysis of variance (ANOVA). (B) Hepatic NKT, CD4, and CD8 

T cell levels of MYC mice described in Fig. 1A. Data represent mean + 
SEM of two pooled experiments. P < 0.05, two-way ANOVA. (C and 

D) Hepatic NKT cells in tumor-free C57BL/6 mice were fed with ABX or 
H20. Data represent mean + SEM of two pooled experiments. n = 9 for 
H20, 10 for ABX. P < 0.05, Student's t test. (E) Hepatic NKT cells in tumor- 
free BALB/c mice fed with ABX or H20. Data represent mean + SEM of 
two pooled experiments. n = 5, P < 0.05, Student's t test. (F) 
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Representative CXCR6 staining in hepatic NKT cells from three 
independent experiments. (G) Representative NKT cell staining in CXCR6* 
liver infiltrating mononuclear cells from three independent experiments. 
(H) Composition of CXCR6* liver infiltrating mononuclear cells in tumor- 
free C5/7BL/6 mice fed with ABX or H20. DN T: double negative T cells. 
Data represent pooled results from three experiments. (I) Levels of 
CXCR6™ liver-infiltrating cells in tumor-free C57BL/6 mice fed with ABX or 
H20. Data represent mean + SEM of two pooled experiments. n = 9 for 
H20, 10 for ABX. P < 0.05, Student's t test. (J and K) IFN-y and CD69 levels 
of hepatic NKT cells after in vivo stimulation by injecting aGalCer-loaded 
A20 tumor cells (Gal/A20) into vancomycin (Vanco) or H2O-fed BABL/c 
mice. Data represent mean + SEM of two pooled experiments. n = 9, P < 
0.05, Student's t test. 
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Cholestyramine treatment increased hepatic 
NKT and CXCR6* cells, but not CD4* T or CD8* 
T cells (Fig. 4D and fig. S5D). In parallel, Cacli6 
mRNA was up-regulated in the liver (fig. S5E). 
This result suggests that bile acids are involved 
in the accumulation of NKT cells in the liver 
through CXCL16 regulation. To identify the bile 
acids involved in NKT cell regulation, the liver 
bile acid profile was determined. As reported 
(26), control mouse liver contains the highest 
amount of primary bile acid taurocholic acid 
(TCA), followed by primary bile acid tauro- 
B-muricholic acid (T-B-MCA), and secondary 
bile acid tauro-w-muricholic acid (T-w-MCA) 
(Fig. 4E). ABX treatment did not affect liver 
TCA but significantly increased primary bile 
acids T-B-MCA and B-MCA. Gut commensal bac- 
teria convert primary bile acids into secondary 
bile acids (28). As expected, secondary bile acids 
T-w-MCA, taurodeoxycholic acid (TDCA), w-MCA, 
taurolithocholic acid (TLCA), and tauroursodeox- 
ycholic acid (TUDCA) were reduced in ABX- 
treated mice (Fig. 4E). 

To connect these findings, we studied the 
effect of bile acids on CXCLI16 expression in 
primary murine LSECs in vitro. Secondary bile 
acid w-MCA decreased cacl16 mRNA expression, 
whereas the primary bile acid T-B-MCA induced 
Cxcl16 MRNA (Fig. 4F). Because cholestyramine 
treatment reduces both primary and secondary 
bile acids and caused NKT accumulation, we 
tested the possibility that primary and secondary 
bile acids have opposing effects on CXCLI16 reg- 
ulation. Indeed, the secondary bile acid compro- 
mised the primary acid-induced Cxcll16 mRNA 
up-regulation, and even impaired cxcl16 mRNA 
expression below the untreated baseline levels. 
(Fig. 4G). When mice were fed w-MCA, a sec- 
ondary bile acid, hepatic NKT accumulation was 
reversed, whereas feeding chenodeoxycholic acid 
(CDCA), a primary bile acid, enhanced NKT cell 
accumulation (Fig. 4H). These data demonstrate 
that primary and secondary bile acids influence 
hepatic NKT cell recruitment and have opposing 
effects in vivo. 

Next, bile acids were fed to mice to investigate 
the effect on liver tumor growth. As expected, 
secondary bile acids LCA or w-MCA reversed 
the inhibition of liver tumor growth caused by 
antibiotic treatment, and primary bile acid CDCA 
further enhanced tumor inhibition (Fig. 41 and 
fig. S5, F to H). Enforced CXCL16 expression in 
the liver (Fig. 41 and fig. S5F) or depletion of 
CD4* NKT cells (fig. S5G) eliminated the effect 
of secondary bile acids on liver tumor growth 
(Fig. 41 and fig. S5, F and G), suggesting that 
this effect was NKT cell mediated. Similarly, 
secondary bile acids failed to affect liver tumor 
development in liver-NKT deficient (CXCR67~) 
mice (Fig. 4J) but still preserved the function to 
block the accumulation of transferred wild-type 
NKT cells caused by antibiotic treatment (fig. S51. 
Therefore, depleting commensal bacteria with 
antibiotics leads to a model where primary bile 
acids that induce CXCLI16 are preserved, whereas 
secondary bile acids that inhibit CXCL16 are 
reduced. This leads to up-regulation of CXCL16 
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Fig. 3. Hepatic NKT cells mediate the inhibition of liver metastasis in mice. (A to D) B16 tumor cells 
were intrasplenically injected into mice pretreated with ABX or H20. One day before tumor injection, 
mice were given intraperitoneal injection of a combination of antibodies to CD4 (anti-CD4) (500 ug per 
mouse) and anti-CD8 (200 wg per mouse) or anti-CD8 alone (200 ug per mouse). Liver surface 
metastatic nodules were counted. Representative images are shown. Data represent mean + SEM 

of two pooled experiments. n = 16 for H20, 14 for ABX, 13 for anti-CD4 + anti-CD8, 5 for anti-CD8. P < 
0.05, one-way ANOVA. (E and F) Loss of hepatic NKT abrogated the inhibition of liver metastasis 
caused by ABX. CXCR6°“~, CD1d~”~, or wild-type mice pretreated with ABX or H2O were given EL4 tumor 
cell tail vein injections. Liver surface metastatic nodules were counted. Data represent mean + SEM 

of two pooled experiments. n = 10 for Wt-H20, 7 for Wt-ABX, 4 for CXCR6-“--H20, 5 for CXCR6-’--ABX, 
5 for CDld-’~-H,0, 4 for CD1d~’--ABX. P < 0.05, two-way ANOVA. 


in LSECs and accumulation of NKT cells in 
the liver. 


Clostridium species regulates liver NKT 
accumulation in mice 


Although ABX treatment efficiently reduced gut 
commensal bacteria load, it did not result in 
complete elimination of gut bacteria (fig. S1A). 
To rule out the possibility that the remaining 
bacteria mediate NKT accumulation, we repeated 
the experiment in germ-free mice. Whereas more 
hepatic NKTs were found in germ-free mice when 
compared with the matched SPF control mice 
(Fig. 5A and fig. S5, J and K) (29), no change was 
seen in TLR4-knockout mice (fig. S5L). Similarly, 


Cacl16 mRNA levels were higher in the liver of 
germ-free mice (Fig. 5B). 

Next, we tried to identify the commensal bac- 
teria responsible for the observed liver NKT ac- 
cumulation. Because the ABX antibiotic cocktail 
used here contains three antibiotics with a dif- 
ferent activity spectrum, individual antibiotic 
treatment was performed to narrow down the 
targeting bacteria. Vancomycin alone was suffi- 
cient to increase hepatic NKT cells, whereas 
neomycin had a marginal effect (Fig. 5C and 
fig. S5M). A clear increase of liver NKT cells was 
seen in mice fed with cefoperazone, but no sig- 
nificant changes in hepatic CD4* or CD8 T* cells 
were observed (fig. S5, N and O). Interestingly, 
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ABX or H20. CXCL16 MRNA levels were measured 
by real-time polymerase chain reaction (PCR). Data 
represent mean + SEM of three pooled experiments. 
n= 9, P < 0.05, two-way ANOVA. (C) Immunohisto- 
chemistry staining of CXCL16 and LYVE-1 in liver sections from ABX- or H2O- 
treated mice. CXCL16* (red)/LYVE* (green) LSECs are highlighted by arrows 
and are shown in more detail in the insert. Scale bar, 50 um. (D) Mice were 
fed with a 2% cholestyramine (CHOL) or control diet (Con). Hepatic NKT, 
CXCR6*, CD4 T, and CD8 T cells were measured. Data represent mean + SEM 
of two pooled experiments. n = 5 for control, 9 for cholestyramine diet. 

P< 0.05, two-way ANOVA. (E) Liver bile acids profile of ABX- or H2O-treated 
mice. Data represent mean + SEM of two pooled experiments. n = 9, P < 0.05, 
Student's t test. (F and G) Isolated LSECs were treated with different bile 
acids (F) or a combination of T-B-MCA with T-w-MCA or w-MCA (G). Data 
represent mean + SEM of two pooled experiments. n = 5, P < 0.05, one-way 
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ANOVA. (H) ABX-treated mice were given three times oral gavage of 
CDCA or w-MCA three times at the dose of 6 mg per 15 g body weight. 
Hepatic NKT cell levels from different treatments were measured. Data 
represent mean + SEM of two pooled experiments. n = 5, P < 0.05, two-way 
ANOVA. (1) A20 liver metastasis in vancomycin (Vanco) or H2O-treated 
mice receiving LCA or CDCA. As indicated, one group of LCA-treated mice 
also received hydrodynamic injection to force CXCL16 expression in the 
liver. Data represent mean + SEM of two pooled experiments. n = 15 for 
Vanco + LCA, others n = 10. P < 0.05, one-way ANOVA. (J) EL4 liver 
metastasis in ABX-treated CXCR6~“~ mice with or without m-MCA feeding. 
Data represent mean + SEM of two pooled experiments. n = 5. 
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both vancomycin and cefoperazone target gram- 
positive bacteria and have been reported to in- 
crease primary bile acids and deplete secondary 
bile acids in the gut (30), which is consistent 
with our finding that bile acids also change in 
the liver and regulate NKT cell accumulation. 

The 7o-dehydroxylation reaction is the most 
quantitatively important process performed 
by the gut bacteria in the production of sec- 
ondary bile acids (28) and is restricted to gram- 
positive bacteria of the Clostridium cluster XIV 
(31). We found that vancomycin, which increased 
liver NKT cells, depleted Clostridium, whereas 


Fig. 5. Clostridium species A 
influence bile acid—controlled 10 


NKT cell accumulation. (A) Hepatic 5 

NKT cell levels from germ-free mice E a 8 
or matched SPF mice were o = é 
measured. Data represent mean + og 
SEM of two pooled experiments. Pay = 4 
n= 5, P < 0.05, Student's t-test. o 
(B) cxcll16 mRNA expression in liver x 3 2 
tissue from germ-free or SPF mice. 2 


Data represent mean + SEM of 

two pooled experiments. n = 8 for 

SPF, 7 for GF. P < 0.05, Student's D 
t-test. (©) Naive C57BL/6 mice were 

fed with vancomycin (Vanco), 

neomycin (Neo) or cefoparazone 

(Cefo). Hepatic NKT levels were 
determined. Data represent mean + 

SEM of three pooled experiments. 

n = 18 for H20, 14 for vancomycin, 14 

for neomycin, 10 for cefoperazone. 

P < 0.05, one-way ANOVA. (D and 

E) Mice were treated with vancomycin 

for 1 week and then gavaged with 

C. scindens or vehicle (cessation). 
Twenty-four hours after C. scindens F 
gavage, 16S rRNA sequencing analysis 

of stool samples was performed. 

The relative abundance of OTUs in 

the fecal bacterial are shown (D). 
Time-course study of hepatic NKT of 
levels was performed (E). Data = 
represent mean + SEM of two pooled 
experiments. n = 10 for H20, DO, 

C. scindens D4, and Cessation D4; 
5 for C. scindens D2, Cessation D2, 
C. scindens D7, and Cessation D7. 

P < 0.05, two-way ANOVA. (F and 
G) A20 liver tumors were induced in 
mice treated with vancomycin or 
H20. Mice were colonized with 

C. scindens or control C. innocuum 
as illustrated in (F). Cumulative liver 
tumor counts are shown in (G). 
Data represent mean + SEM of two 
pooled experiments. n = 10 for H20 
and Vanco, n = 20 for C. scindens 
and C. innocuum. P < 0.05, one-way 
ANOVA. (H) SK-HEPI cells were 
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neomycin, which had little effect on liver NKT, 
had little effect on Clostridium (fig. S5P). To 
explore a role for Clostridium species on liver 
NKT cell accumulation, we used C. scindens, 
which is commonly found in both mice and hu- 
mans (32) and has a conserved bai (bile acid in- 
ducible) gene operon for the 7o-dehydroxylation 
reaction (28, 33). Mice were fed with vancomycin 
for 1 week to induce hepatic NKT cells, then van- 
comycin was stopped, and the mice were given 
C. scindens or vehicle (fig. S6A). The successful 
colonization with C. scindens was confirmed (fig. 


Vanco Cessation C.Scindens 


BALB/c mice -7 0 4 8 


S6B). One day after oral gavage, fecal bacterial 


were analyzed by 16S ribosomal (rRNA) sequenc- 
ing (Fig. 5D and fig. S6, C and D). Continuous 
vancomycin treatment caused a reduction of 
Clostridiales and Bacteroidales but an expansion 
of Verrucomicrobiales compared with the water 
control group. An early recovery of Bacteroidales 
was observed after vancomycin cessation. Gavage 
of C. scindens increased the Clostridiales popula- 
tion. A time course study showed that hepatic 
NKT levels started to drop between day 2 and 
day 4 after vancomycin withdrawal (Fig. 5E), 
suggesting recovery of gut commensal bacteria. 


Clostridium cluster XIV recovered in parallel 
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treated with different bile acids. CXCL16 mRNA levels were measured by real-tirme PCR. Data represent mean + SEM of three pooled experiments. n > 10, 
P < 0.05, one-way ANOVA. (I) Correlation between primary bile acid CDCA and CXCL16 mRNA expression in nontumor liver tissues from hepatocellular 
carcinoma or cholangiocarcinoma patients of the TIGER cohort. Pearson correlation coefficient test was performed. 
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(fig. S6E). Colonization of C. scindens induced 
a rapid reduction of hepatic NKT cells (Fig. 5E) 
but did not affect other immune cells (fig. S6, F 
and G). As expected, colonization of mice with 
C. scindens, but not cessation of ABX treatment, 
resulted in a reduction of primary bile acids and 
recovery of secondary bile acids on day 2 (fig. 
S6H). Thus, our results suggest that bile acid- 
converting Clostridium species such as C. scindens 
are involved in the regulation of hepatic NKT 
cell accumulation. We next tested the effect of 
C. scindens on liver tumor growth. As expected, 
more liver tumors were found in C. scindens— 
colonized mice than mice kept on vancomycin or 
C. innocum-colonized mice (Fig. 5, F and G), sug- 
gesting that gut bacteria capable of metaboliz- 
ing bile acids can regulate growth of liver tumors. 


Bile acids control liver CXCL16 
expression in humans 


We extended our study to human samples to de- 
termine the effect of bile acids on CXCLI6 mRNA 
expression of human liver sinusoidal endothelial 
cells. Similar to the mouse study, primary bile 
acids CDCA and TCA induced CXCLI6 mRNA 
expression in SK-HEPI cells (Fig. 5H). Next, the 
correlation between bile acids and CXCLI6 ex- 
pression was tested in nontumor liver tissues 
from HCC or cholangiocarcinoma patients of the 
Thailand Initiative for Genomics and Expression 
Research in Liver Cancer (TIGER-LC) cohort (34). 
Primary bile acid CDCA levels correlated with 
CXCLI6 expression (Fig. 51), whereas inverse 
correlation was seen for the secondary bile acid 
glycolithocholate (GLCA) (fig. S6I). The primary/ 
secondary ratio was associated with CXCL16 in- 
crease (fig. S6J), indicating that the opposing 
effect of bile acids on CXCLI16 expression also 
exists in humans. 

Mucosal-associated invariant (MAIT) cells are 
enriched in the liver and comprise 20 to 50% of 
hepatic lymphocytes in humans (35). Interestingly, 
CXCR6 is expressed on MAIT cells (35), which 
raises the question whether liver MAIT cells can 
also be controlled by gut bacteria through CXCLI6 
regulation. MAIT cells recognize bacterial deriv- 
atives and are involved in inflammatory liver 
diseases such as nonalcoholic steatohepatitis 
(NASH). Besides antitumor function, NKT cells 
have been reported as important regulators of 
autoimmune responses (36). Our study suggests 
that gut commensal bacteria and bile acids could 
be potential targets for controlling liver auto- 
immune diseases. In summary, we have describe 
a mechanism by which the gut microbiome uses 
bile acids as messengers to control the accumu- 
lation of hepatic NKT cells and thereby antitumor 
immunity in the liver of mice. These findings not 
only have possible implications for future cancer 
therapeutic studies in humans but also provide a 
link between the gut microbiome, its metabolites, 
and immune responses in the liver. 


Materials and methods 
Murine studies 


SPF C57BL/6 and BALB/c mice were purchased 
from Charles River. CXCR6-knockout mice were 
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purchased from Jackson laboratory. CD1d-knockout 
mice, LAP-tT and TRE-MYC mice have been pre- 
viously described (10). Germ-free mice were pro- 
vided by R. Goldszmid (Cancer and inflammation 
program, NIH). Newly purchased four-week old 
C57BL/6 or BALB/c mice were randomized into 
5 mice/cage and housed for one week to normalize 
gut microbiome. Then mice were assigned into 
H,O or ABX treatment groups. Mice in the ABX 
group received three-antibiotic cocktail in the 
drinking water containing vancomycin (Hospira, 
0.5 g/L), neomycin (VETone, 0.5 g/L) and pri- 
maxin (Merck & CO, 0.5 g/L) as previously re- 
ported (3). In some experiments mice were given 
single antibiotic water, and cefoperazone (MP 
Biomedicals) was given at the concentration of 
0.5 g/L. Fresh antibiotic water was replaced every 
other day. After 3 weeks of ABX pretreatment, 
mice were challenged with different tumor cell 
lines. B16-F1 and A20 cells were purchased from 
ATCC. ELA cells were used as described (37). 1 x 
10° ELA tumor were given by subcutaneous or tail 
vein injection, 3 x 10° B16-F1 tumor cells were 
given by intrasplenic injection as described be- 
fore (11), and 1 x 10° A20 tumor cell were given 
by tail vein injection. In some experiments mice 
were fed with a 2% cholestyramine diet made by 
Research Diets Inc (New Brunswick, NJ). Mice 
were treated with 500 ug anti-CD4 (clone GKL.5, 
BioxCell) or 200 ug anti-CD8 (clone 2.43, BioxCell) 
24 hours before receiving tumor injection for 
depletion studies. For in vivo NKT cell stimula- 
tion, 1 x 10° aGalCer-loaded A20 tumor cells in 
the combination of brefeldi A (500 ug/mouse) 
were given by tail vein injection, and mice were 
sacrificed 3 hours after injection. aGalCer-loading 
was performed by incubate A20 cells with 1 ug/ml 
aGalCer overnight followed by 3 times of wash- 
ing. At the experimental end points, mice were 
sacrificed for organ harvest. All experiments 
were conducted according local institution guide- 
lines and approved by the Animal Care and Use 
Committee of the National Institutes of Health, 
Bethesda, USA. 


Flow cytometry 


Cells were surface-labeled with the indicated 
antibodies for 15 min at 4°C. Intracellular stain- 
ing using a Foxp3/transcription factor staining 
buffer set (eBioscience) was used according to 
the manufacturer’s instructions. Flow cytometry 
was performed on BD LSRFortessa platform and 
results were analyzed using FlowJo software version 
9.3.1.2 (TreeStar). Dead cells were excluded by 
using live/dead fixable near-IR dead cell staining 
kit (ThermoFisher scientific). The following anti- 
bodies were used for flow cytometry analysis: 
anti-TCRb-BV510 (clone H57-587, Biolegend), 
PBS57/CD1d- tetramer-APC (NIH core facility), 
anti-CXCR6-FITC (clone SAO51D1, Biolegend), anti 
CD3-PE (clone 17A2, Biolegend), anti-CD4-PE 
(clone RM4-5, Biolegend), anti-CD4- Alexa 
Fluor 700 (clone GK1.5, Biolegend), anti-CD8- 
BV210 (clone 53-6.7 Biolegend), anti-CD19-PerCP/ 
Cy5.5 (clone eBio1D3, eBioscience), anti-CD49b 
(clone DX5, eBioscience), anti-TCRg/d-PE, (clone 
GL3, BD pharmigen), anti-CD11b-BV421 (clone 


M1/70, Biolegend), anti-Ly6G-Alexa Fluor 700 
(clone 148, Biolegend), anti-Ly6C-AP (clone HK1.4, 
Biolegend), anti-CD44-PE/Cy7 (clone IM7, eBio- 
science), anti-CD62L- PerCP/Cy5.5 (MEL-14, Bio- 
legend), anti-CD69-Pacific blue (clone H1.2F3, 
Biolegend), anti-CD25-FITC (clone 7D4, BD phar- 
migen), anti-4-1BB-PE (clone 17B5, Biolegend), 
anti-Foxp3-Alexa Fluor 488 (clone 22F6, Bio- 
legend), anti-Tbet-Pacific Blue (clone 4B10, Bio- 
legend), anti-RORg-PE (clone B2D, eBioscience), 
anti-PLZF-PerCP/Cy5.5 (clone 9E12, Biolegend), 
and anti-CD1d-PE (clone 1B], eBioscience), anti- 
IFNg-PE (clone XMGI1.2, BD Biosciences), anti- 
TNFa-PerCP/Cy5.5 (clone MP6-XT22, Biolegend). 
The following markers were used for identify- 
ing different immune cell subsets: TCRb*CD1d- 
Teteramer* for NKT cells, CD3*CD4" for hepatic 
CD4* T cells, CD3*CD8* for CD8* T cells, CD3" 
CD19* for B cells, CD3°CD49b* for NK cells, 
CD3*TCRe/d* for g/d T cells, CDIIb*Ly6G*Ly6C° 
for G-MDSC. Absolute numbers were calculated 
by multiplying frequencies obtained from flow 
cytometry by the total live mononuclear cell 
count, and then divided by liver weight. 


In vivo cytotoxcicity assay 


Splenocytes isolated from naive C57BL/6 mice 
were loaded with aGalCer (1 ug/ml) then labeled 
with high dose of CFSE as target cells. Unloaded 
cells were labeled with low dose of CFSE as con- 
trol cells. Then CFSE" target cell and CFSE" con- 
trols cells were mixed at about 1:1 ratio. 10’ mixed 
cells were injected intravenously into ABX or 
H,O-treated C57BL/6 mice. Sixteen hours later, 
mice were killed and cytotoxicity was analyzed 
by flow cytometry. r = (%CFSElo / %CFSEhi); ro 
is the ratio of mixed cells without injection; % 
cytotoxicity = [1- (r9/r) ]x100. 


Immunohistochemistry and quantification 


Immunohistochemistry was performed on 3 um 
sections obtained from formalin- fixed paraffin- 
embedded liver tissues of H2O-treated (n = 5) or 
ABX-treated (n = 5) mice using the Opal 5-color 
IHC Kit (PerkinElmer, Waltham, MA, USA) ac- 
cording to the manufacturer’s instructions. The 
following primary antibodies were used: anti- 
CXCLI6 (Bioss Antibodies, Woburn, MA, USA; bs- 
1441R, rabbit polyclonal, 1/4.000, Opal 620), anti- 
LYVEI1 (Abcam, Cambridge, UK; rabbit polyclonal, 
1/15.000, Opal 520). Slides were evaluated using 
the Vectra® 3 automated, high-throughput quan- 
titative pathology imaging system (PerkinElmer) 
and the inForm software (PerkinElmer) for segmen- 
tation and quantification of CXCL16*/LYVEI" cells. 


Hepatic bile acid profiling 

Fresh mice liver tissue was snap frozen in liquid 
nitrogen and then kept at -80°C. Hepatic bile acid 
composition was measured at West Coast Metab- 
olomics Center at UC Davis using the targeted 
metabolite analysis service. 


Liver sinusoidal endothelial cell 
preparation and bile acids treatment 


Primary mice liver sinusoidal endothelia cells 
were isolated as previously described (38). Briefly, 
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mice were CO2 euthanized, and then the portal 
vein was cannulated and the liver was perfused 
with 0.05% collagenase in Ca”* deprived medium. 
Liver cells were dissociated and parenchymal cells 
were killed by incubation in 0.04% collagenase in 
Gey’s balanced salt solution at 37°C for 15 min. 
Then density gradient centrifugation was per- 
formed using Nycodenz solution at the final solu- 
tion of 1.089 g/cm?. LSECs were isolated using 
anti-LSEC microbeads (Miltenyi) according to the 
manufacturer’s instructions. LSECs or the human 
SK-HEPI cell line (ATCC, HTB-52) were treated 
with different bile acids for 24 hours. Gene ex- 
pression was analyzed by real-time PCR. TCA, 
CDCA, DCA, and TDCA were purchased from 
Sigma. T-b-MCA, w-MCA, and T- w-MCA were 
purchased from Steraloids Inc. 


In vivo bile acids feeding 


Mice were kept on ABX cocktail and fresh ABX 
was replaced every other day. Mice were fed with 
bile acids by oral gavage 48, 24 and 16 hours 
prior to sacrifice. For A20 tumor bearing mice, 
w-MCA, LCA or CDCA were given 3 times/week. 
Bile acids were dissolved in corn oil. w-MCA and 
CDCA were given at the dose of 6 mg/15 g body 
weight, LCA was given at the dose of 1 mg/15 g 
body weight. 


Adoptive transfer of NKT cells 


Donor NKT-rich cells were isolated from livers of 
wild-type C57BL/6 mice by autoMACs sorting of 
NK1.1" cells. Half million of NKT-rich cells were 
transferred into CXCR6/~ mice by tail vein 
injection. Two days later mice were scarified and 
the liver accumulation of transferred NKT cells 
were measured by flow cytometry as TCRb*CD1d- 
Teratmer*CXCR6* population. 


Gut colonization with 
Clostridium scindens 


Mice were fed with vancomycin in drinking water 
(Hospira, 0.5 g/L) for one week. Fresh antibiotic 
water was replaced every other day. One week 
later vancomycin was stopped, and the mice 
were given oral gavage of 10° C. scindens or ve- 
hicle (anaerobic glycerol) every day for 5 days. 
C. scindens was purchased from ATCC (35704), 
and grown under anaerobic conditions. One 
day after gavage, the colonization of C. scindens 
were confirmed by real-time PCR using primers 
specific for C. scindens. For A20 tumor study, 
BALB/c mice were fed with vancomycin for one 
week. Then vancomycin was stopped, and A20 
tumor (1 x 10° cells) were injected intravenously. 
Mice were given oral gavage of 10° C. scindens or 
C. innocuum (ATCC 14501) every day for 4 days. 
Then mice were given a second round of van- 
comycin treatment for 4 days, followed by addi- 
tional four days of oral gavage of C. scindens or 
C. innocuum. Fourteen days after tumor injection, 
mice were sacrificed and liver tumor burden was 
measured. 


16S rRNA sequencing and analysis 


Mouse stool DNA extraction and 16S V4 region 
amplification were performed on the liquid hand- 
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ling robots (Eppendorf, epMotion5075 and epMo- 
tion 5073). The V4 region of the 16S rDNA gene 
(515F-806R) was sequenced; generating partially 
overlapping, paired-end reads on the Illumina 
MiSeq platform. After quality control filtering; 
a total of 3,979,728 reads were processed with an 
average of 132,657 reads per sample. The demul- 
tiplexed FASTQ files containing the 16S rRNA 
gene sequences were filtered for chimeric sequen- 
ces using the USEARCH (version 8.1.1831) utility's 
UCHIME implementation and the ‘gold’ database 
(version microbiomeutil-r20110519). The reads 
were then binned into Operational Taxonomic 
Units (OTUs) at 97% similarity using USEARCH’s 
cluster_otus command. The OTUs thus obtained 
were classified and aligned using QUIME (1.9.1) 
scripts. The assign_taxonomy.py script was used 
to assign taxonomy using the default RDP method 
and the default GreenGenes database. This pro- 
vided insight into the larger trends at higher 
taxonomic levels (such as order Clostridiales). 
The 16S rRNA sequencing data was deposited 
into Sequence Read Archive (SRA) public database 
with the accession number SRP136953. 


Human studies 


Nontumor specimens derived from a set of 142 
patients of the TIGER-LC Consortium were used 
in this study (34). Transcript expression was mea- 
sured using the Affymetrix Human Transcrip- 
tome Array 2.0. Data has been deposited into the 
Gene Expression Omnibus (GEO) public database 
at NCBI (GEO Series GSE76297). A total of 718 
biochemical metabolite species were measured by 
Metabolon's Discover HD4: Platform. All expres- 
sion and metabolite data were log2 transformed. 
Pearson correlation analysis was performed using 
GraphPad Prism 7 to determine correlation be- 
tween CXCLI6 gene expression and selected 
metabolites. Due to the missing information 
caused by detection limitations, 85 valid patient 
data were used to correlate CDCA and CXCLI16 
expression. 


Statistical analysis 


The sample sizes for animal studies were guided 
by previous murine studies in our laboratory. 
Statistical analysis was performed with Graph- 
Pad Prism 7 (GraphPad Software). The significant 
differences between groups were calculated by 
Student’s unpaired ¢ test, one-way, or two-way 
ANOVA (Tukey’s and Bonferroni’s multiple com- 
parison test). Welch’s corrections were used when 
variances between the groups were unequal. P < 
0.05 was considered as statistically significant. 
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PALEOCLIMATE 


A 550,000-year record of East Asian 
monsoon rainfall from ‘Be in loess 


J. Warren Beck,””** Weijian Zhou,*”* Cheng Li,? Zhenkun Wu,”* Lara White,”* 
Feng Xian,* Xianghui Kong,”* Zhisheng An*’® 


Cosmogenic !°Be flux from the atmosphere is a proxy for rainfall. Using this proxy, we 
derived a 550,000-year-long record of East Asian summer monsoon (EASM) rainfall from 
Chinese loess. This record is forced at orbital precession frequencies, with higher rainfall 
observed during Northern Hemisphere summer insolation maxima, although this response 
is damped during cold interstadials. The *°Be monsoon rainfall proxy is also highly 
correlated with global ice-volume variations, which differs from Chinese cave 510, which is 
only weakly correlated. We argue that both EASM intensity and Chinese cave 5'°0 are 

not governed by high-northern-latitude insolation, as suggested by others, but rather by 
low-latitude interhemispheric insolation gradients, which may also strongly influence 
global ice volume via monsoon dynamics. 


hinese speleothem '0/"°O (d’80) isotope 
ratio records (J, 2), such as the Sanbao 
Cave record, are clearly linked to the Asian 
monsoon and provide some of the most 
high-resolution and best dated records of 
tropical paleoclimate available. Their extraor- 
dinary match with high-latitude (65°N) summer 
insolation variations suggests a strong coupling 
with high-latitude climate (1; however, the or- 
igins of this coupling and of the 8’80 variations 
themselves remains controversial. Wang et al. (2) 
originally interpreted Chinese speleothem &'*O 


as reflecting changes in the mixing ratio of 
summer versus winter monsoon moisture, each 
differing in isotopic composition. Subsequent 
authors have instead suggested that cave 8'%O 
variations were a response to changing upstream 
Rayleigh fractionation (3) or changes in the sea- 
sonality of precipitation (4), or were a proxy for 
monsoon intensity (J). Still others (5) assert that 
cave 8'°O cannot be strictly a proxy for Asian 
monsoon intensity because according to them, 
intensity should be phase-lagged from 65°N solar 
insolation. Thus, it seems the question of what 


controls cave 8'°O remains unclear. Here, we 
reinterpret these cave records in the context 
of our !°Be-based East Asian summer monsoon 
(EASM) rainfall record. 

Meteoric ‘Be preserved in Pleistocene Chinese 
loess can be used as a proxy for monsoon paleo- 
rainfall. Our sampling site (N34.43°, E107.12°) was 
chosen in order to study the paleo-EASM, which 
dominates the regional climate there, provid- 
ing ~85% of annual rainfall. ‘Be is a long-lived 
cosmogenic radionuclide produced in the atmo- 
sphere and does not form gaseous species but 
instead attaches to atmospheric dust, with dep- 
osition mainly mediated by wet precipitation 
events (6, 7). Its flux can thus be quantitatively 
linked with rainfall amount, after correcting for 
geomagnetic field and recycling effects (supple- 
mentary materials). Once on the ground, ‘Be be- 
comes an immobile oxidized component of the 
sediments, permitting extraction of a rainfall 
record with minimal signal loss from diagenesis. 
Direct comparison with the nearby U/Th-dated 
Sanbao Cave 8'°0 record (1), which is also re- 
portedly a record of monsoon strength, was made 
possible by establishing an age model for this 
record by correlating the large number of similar 
features observed in both records. 


Main findings 

Shown in Fig. 1A is our !°Be-proxy rainfall record 
(Fig. 1A, red curve) compared with the Sanbao 
880 record (Fig. 1A, yellow curve) (1). !°Be re- 
veals that EASM rainfall increases during North- 
ern Hemisphere summer insolation maxima and 
decreases during insolation minima (Fig. 1A, gray 
dashed curve) (8). Fig. 1A shows that EASM rain- 
fall intensity is inversely correlated with speleo- 
them 8780; however, the scaling deviates during 


Fig. 1. ?°Be-proxy EASM rainfall 
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the cold interstadials of marine isotope (MIS)- 
3, -6, -8, -10, and -12, where our '°Be-rainfall 
proxy has lower amplitude than that of the cave 
isotopes. As a result, we found that only 16.3% 
of the variance in the ‘°Be-proxy rainfall record is 
correlated with the Sanbao record (fig. S1A). 
Nevertheless, both exhibit strong forcing at pre- 
cessional periods. 

On the other hand, our !°Be-rainfall record is 
strongly correlated with global ice volume proxies. 
Shown in Fig. 1B is our record plotted against the 
Red Sea (RSL) paleosea level reconstruction (8), 
whereas in Fig. 1C we compare it with the LR04 
benthic foraminifera 5'°O record (9). Our rainfall 
record exhibits very similar structure, with re- 
spectively 57 and 51.7% of its variance reflected 
in each of these two global ice volume proxies 
(fig. S1, B and C). This suggests that EASM in- 
tensity is closely coupled to high-latitude ice vol- 
ume variations by some mechanism. 

Surprisingly, this is not true of the Sanbao Cave 
8'80 record, which is highly correlated with 65°N 
summer insolation, but only 10.6% of its variance 
is correlated with sea-level or global ice volume 
amplitude (Fig. 2). This creates a conundrum: 
Although cave 5"4O and ’°Be monsoon proxies are 
modestly correlated with each other, the former 
is highly correlated with insolation but not ice 
volume, whereas the latter is highly correlated 
with ice volume but not insolation. The Chinese 
cave community asserts that its isotope records 
imply that Asian monsoon intensity is controlled 
by orbital forcing of high-northern-latitude in- 
solation (1, 10); but if so, how can their proxy for 
monsoon intensity also be essentially uncorre- 
lated with global ice volume? This seemingly 
contradicts a principal tenet of the Milankovitch 
theory, that high-northern-latitude solar insola- 
tion controls global ice volume and, through ice 
albedo feedback, also controls global temper- 
ature and, by corollary, tropical monsoon strength. 
How do we reconcile these two monsoon rec- 
ords, and can we do so without violating the 
Milankovitch theory? 


Interpretation 


We suggest that these conflicts may in part be 
resolved by reinterpreting the cave isotopes as a 
two-component moisture-mixing proxy instead 
of monsoon intensity and by recognition that 
monsoon intensity is determined mainly by low- 
latitude, not high-latitude, insolation variations. 
Why, then, are Chinese cave 50 records so strongly 
correlated with 65°N summer insolation (Fig. 3A)? 
The reason is that 65°N insolation exhibits nearly 
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identical phase, net-range, and pattern of varia- 
tions as those of the low-latitude (30°N to 30°S) 
June solar insolation gradient (Fig. 3B). This find- 
ing partly resolves the conflict between the cave 
records and Milankovitch theory because a con- 
nection with ice volume (and ice albedo) is not 
required if monsoons are forced from low lati- 
tudes. Still, this observation weakens the concept 
that global climate control emanates principally 
from high-northern-latitude insolation. 

The idea that global climate control should 
chiefly emanate from low latitudes seems a more 
natural perspective because the amount of solar 
energy falling between 30°N and 30°S (that is, 
the region controlling the monsoons) is more 
than an order of magnitude larger than that fall- 
ing above 65°N. Furthermore, the monsoons are 
the primary agent by which this incident energy 
is transported aloft or to high latitude, where it 
can be reradiated to space. When coupled with 
our observation that the ?°Be monsoon intensity 
proxy is highly correlated with global ice volume, 
this model leads us to suggest that the mon- 
soons may in fact be an important driver of high- 
latitude climate, rather than the reverse. This 
model thus deviates from classical Milankovitch 
theory, although it retains the idea that orbital 
forcing of insolation is important. 

We argue that it is the low-latitude interhem- 
ispheric insolation gradient that controls both 
EASM intensity and Chinese cave 8'80. We as- 
sert that this gradient modulates the pattern of 
upper-tropospheric outflow from the Asian Mon- 
soons to the North Pacific versus Southern Indian 
Ocean subtropical highs and in so doing regu- 
lates the relative strength of Hadley and Walker 
circulations in the Indo-Pacific sectors. This in 


turn influences the relative strength of the Indian 
summer monsoon (ISM) and Western North Pa- 
cific summer monsoon (WNPSM). In this model, 
Chinese cave isotopes are explained by mixing 
between '“O-depleted moisture derived from the 
ISM and relatively ‘8O-undepleted moisture nom- 
inally derived from the WNPSM, both of which 
contribute to EASM rainfall. ISM moisture is gen- 
erally more &'“O-depleted than WNPSM-sourced 
moisture because of rainout during transport 
across India or from greater interaction with 
topography of the Tibetan Plateau (TP) before 
arrival in central China (7-73). Orbital forcing 
of the interhemispheric insolation gradient, we 
argue, results in a higher ISM (8'80-depleted) 
moisture fraction during boreal summer preces- 
sion maxima, and vice versa. When this gradient 
is large, EASM rainfall amount also increases, al- 
though the effect is damped during cold inter- 
stadials because of persistent summer snow cover 
on the TP or orbital modulation of Hadley circu- 
lation strength (J4). 


Modern evidence 


Modern observations (15) do in fact show that it 
is differential heating between the hemispheres 
at low latitudes—not high-latitude insolation— 
that controls tropical monsoon development. Mod- 
ern observations also show that the Asian mon- 
soon is linked to both meridional Hadley and 
zonal Walker circulation pathways via monsoon 
outflow to the southern Indian ocean subtropical 
high (SISH) and north Pacific subtropical high 
(NPSH) (16-19). These two subtropical highs are 
key elements of Hadley and Walker circulation, 
in which rising air generated by the tropical mon- 
soons descends in the subtropics. We assert that 
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Fig. 2. Correlation between Red Sea glacio-eustatic sea-level reconstruction and Sanbao Cave 
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orbital forcing of solar insolation can change the 
ratio of monsoon outflow to these two regions 
differentially, intensifying the subtropical anti- 
cyclone in one sector while weakening it in the 
other. This in turn modulates the relative strength 
of the westerlies and trade winds in each sector, 
with both intensifying in one sector at the ex- 
pense of the other owing to conservation of an- 
gular momentum. But Indian Ocean trade winds 
couple to the Somali jet (15, 19), which influences 
ISM strength. North Pacific trade winds, on the 
other hand, influence the transport of moisture 
into central China from the WNPSM (20). Hence, 
differentially modulating the trade wind strength 
in these two sectors affects the mixing ratio 
of moisture arriving into central China de- 
rived from these two monsoons, and this, we 
believe, is what regulates both cave 5'°O and to 
a lesser extent EASM intensity, as seen in our 
Be proxy. 

That Hadley and Walker circulation in the 
Indian and North Pacific Ocean sectors are in- 
versely coupled to each other via the pattern of 
upper-tropospheric monsoon outflow (J6, 17) is 
well documented in the seasonal cycle of atmo- 
spheric circulation (78, 27). Observations show 
that during Boreal summer, the southern Hadley 
cell strengthens while the northern Hadley cell 
weakens, mainly in response to intensification of 
the Asian Monsoons (figs. S2 to $4). These two 
Hadley cells join at the intertropical convergence 
zone (ITCZ), which is the rising limb of both 
cells. During boreal summer, the ITCZ migrates 
into the Northern Hemisphere (figs. S2 and $3) in 
response to warmer northern and cooler South- 
ern Hemisphere tropics and subtropics (22). Over 
Asia, this migration is accompanied by north- 
ward migration of the upper-tropospheric sub- 
tropical zonal westerly jet (23, 24), which moves 


north of the TP during mid-summer. When it 
does so, it enhances the Tibetan high circulation 
pattern, which helps to redirect Asian monsoon 
outflow to the southern Indian Ocean subtropical 
high. At the same time, boreal summer heating 
over the TP (~30°N) drives onshore air mass 
transport, in which sensible heating from latent 
heat release in the mid-troposphere strongly 
drives convective and monsoon general circula- 
tion (15), producing the world’s strongest cen- 
ter of atmospheric circulation. During austral 
summer, the ITCZ moves back into the Southern 
Hemisphere and the subtropical jet returns to 
the south of the TP as the heating centers re- 
verse hemisphere, strengthening the northern 
and weakening the southern Hadley cells. 
Thus, during boreal summer, northward move- 
ment of the ITCZ accompanied by heating over the 
TP strongly enhances northward cross-equatorial 
atmospheric mass transport in the lower tropo- 
sphere associated with Hadley circulation. This 
requires an upper-tropospheric return flow to the 
Southern Hemisphere for mass balance. When 
coupled with migration of the subtropical jet, these 
factors strengthen the overturning circulation in 
the southern Hadley cell and strengthen both 
the ISM and WNPSM (15, 19, 21, 23). Much of this 
lower tropospheric cross-equatorial mass flow 
occurs in the northwest Indian Ocean Somali jet, 
although a lesser part [~1/3 today (25)] occurs in 
several broad low-level flows north-northeast of 
Australia and the Bay of Bengal (Fig. 4). These two 
regions of low-level cross-equatorial flow are part 
of the reason that the southern Hadley cell com- 
pletely dominates the global meridional, over- 
turning circulation during boreal summer (78, 27). 
The other reason is that the southern subtropical 
highs (~30°S) intensify during boreal summer in 
response to colder austral winter (June-July- 
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Fig. 3. High-latitude versus low-latitude solar insolation gradient variations compared to Sanbao 
Cave d"80, (A) Plot of 65°N June solar insolation (dashed curve) (39) versus Sanbao Cave 8!°0 
record (solid curve) (1). The 8'°O scale is reversed. (B) Plot of 65°N June solar insolation (dashed 
curve) versus (30°N to 30°S) June solar insolation difference (solid curve). Visible are the nearly 
identical pattern and phase and the similar magnitude of changes. 
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August) temperatures. These southern highs (sur- 
face anticyclones) form the descending limb of 
the southern Hadley cell and are further intensi- 
fied by strong upper-tropospheric return flows of 
cold dry air from the Northern Hemisphere that 
balance the low-level northward monsoonal flows 
during boreal summer (Fig. 4) (18, 19, 26). Cold 
sea surface temperature (SST) in the southern 
subtropics further enhances these anticyclones 
by inhibiting evaporation from the sea surface, 
whereas warmer SST in the northern subtropics 
allows increased evaporation into the lower to 
middle troposphere there, weakening those sur- 
face highs (27). 

Thus, the seasonal cycle of differential inter- 
hemispheric heating affects where the monsoon 
outflow goes. Today, although some Asian mon- 
soon outflow falls into the Sahara and Asian 
interior, the remainder is mainly split between 
the NPSH and the SISH. Numerous modeling 
studies (16, 17), reanalysis of modern climate data 
(18, 19, 21), and Lagrangian tracers studies of 
ozone (28) all show that this upper-tropospheric 
monsoon outflow to the Southern Indian Ocean 
is very strong during boreal summer and weak 
to the NPSH but reverses during austral sum- 
mer (fig. S4). These models also show that these 
strong outflows lead to intensified trade winds 
in the colder hemisphere (76, 29), where today 
stronger Indian Ocean trade winds couple di- 
rectly to the Somali jet, intensifying the Indian 
Ocean tropical westerlies and the ISM (Fig. 4). 

Variations in this cyclic seasonal pattern of 
monsoon outflow also occur, and reanalysis of 
modern climate data reveals that these variations 
are linked to changes in SST. In particular, there 
is an antiphased SST coupling between the Indian 
and west central Pacific oceans so that when SST 
in the Indian ocean is anomalously warm, SST in 
the west central Pacific is anomalously cool (30). 
This pattern is associated with stronger WNPSM 
and stronger easterly trade winds in the west Pa- 
cific, but weaker tropical westerlies over the Indian 
ocean and weaker ISM, and vice versa. 

More generally, modern climate data show 
that strong Asian monsoon years are indeed 
associated with both anomalously strong north- 
ward movement of the ITCZ as well as anomalous 
cooling in the Southern Hemisphere subtropics 
(25, 31). At the same time, stronger 20th-century 
ISM correlates with weakening and eastward re- 
treat of the NPSH (20), which we argue is linked 
to weaker monsoonal outflow to the NPSH and 
stronger outflow to the SISH, as required by mass 
balance. 

Putting this all together, reanalysis of modern 
climate data shows that strong Asian summer 
monsoons produce strong upper-tropospheric 
outflow from the Asian monsoons to the SISH 
but weak outflow to the NPSH and vise versa. It 
substantiates the idea that there is an interhem- 
ispheric oscillation linking Indian and Pacific 
ocean SST to trade wind velocity over the North- 
west Pacific and Indian ocean sectors, and that 
the relative trade wind strength regulates the 
relative flux of moisture in the EASM from these 
two sectors. 
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Here lies the linkage between cave 5'°O and 
the subtropical meridional insolation gradient. 
Orbitally induced changes of the (June, 30°N 
to 30°S) subtropical isolation gradient mimic 
the seasonal cycle of low-latitude insolation var- 
iations that drive monsoons. These orbitally 
induced changes can thus either enhance or di- 
minish this seasonal cycle, depending on the 
phase. Modeling results (14, 32, 33) support this 
idea that orbital-induced changes of the low- 
latitude interhemispheric insolation gradient 
do strongly regulate the strength of the winter 
hemisphere Hadley cell, and that this may work 
via SST regulation of monsoon outflow to the 
subtropical highs (33) coupled with changes in 
the mean position of subtropical jet relative to 
the TP (23, 24). In our mechanism, these changes 
produce an “intertropical seesaw” between Hadley 
and Walker circulation that regulates the ratio of 
EASM moisture derived from the ISM versus 
WNPSM, modulating Chinese cave 8'°0O (Fig. 4). 
The effect on EASM intensity—as told by the 
1©Be-proxy—has the same sign and phase but is 
weaker during cold interstadials because per- 
sistent summer snow cover on the TP inhibits 
onshore monsoon air mass transport and keeps 
the subtropical westerly jet positioned south of 
the TP, weakening the Tibetan high. 


High- and low-latitude coupling 


The observation that variation in the Asian mon- 
soon outflow to the SISH versus NPSH is coupled 
to trade wind and westerlies velocities in these 
two sectors reveals another linkage to high- 
northern-latitude ice volume via the Atlantic 
meridional overturning circulation (AMOC) 
(34-38). AMOC strongly affects the oceanic trans- 
port of heat to the high North Atlantic. AMOC in 
turn is strongly influenced by the Agulhas cur- 
rent leakage of high-salinity water masses from 
the southern Indian Ocean into the Atlantic. 
Modulating this source of high-density surface 
waters profoundly influences North Atlantic 
Deep Water production—a critical element of 
AMOC. But Agulhas current leakage is coupled 
to the strength and position of the SISH (the 
Mascarene High). 

Modern observations between 1993 and 2009 
(34) demonstrate that intensification of the SISH 
was indeed accompanied by both southern west- 
erlies and trade wind intensification. The resulting 
increased wind stress coupling to the sea sur- 
face has led to enhanced South Equatorial Cur- 
rent and intensification of the Agulhas western 
boundary current system in the southwest Indian 
Ocean (fig. S5). These observations show that 
both mean kinetic energy and eddy current en- 
ergy in the Agulhas system increased over this 
period. Of particular interest is the observation 
that both westward and southward propagation 
of mesoscale eddies in the southern Agulhas (32 
to 36°S) also increased over this period. West- 
ward propagation of these large eddies into the 
South Atlantic is the chief mechanism by which 
the Agulhas leakage occurs. Thus, modern evi- 
dence also shows that modulating southern 
trade wind velocity over the Indian ocean 
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Fig. 4. Intertropical seesaw: Orbitally induced changes in solar insolation mimic the seasonal 
interhemispheric heating-cooling cycle in the tropics and subtropics. (A) During boreal 
summer insolation maxima, Tibetan high outflow to the SISH is increased in response to two factors: 
(i) colder SST in the South Indian ocean, which enhances the surface anticyclone there, and 

(ii) earlier seasonal transition of the upper-tropospheric subtropical zonal westerly jet (24, 25) 

to the north side of the TP, strengthening the Tibetan high. Both factors enhance Hadley circulation 
and result in strengthening trade winds and Somali jet in the Indian ocean, strengthening the ISM. 
At the same time, outflow to the NPSH is reduced in consequence to warmer north Pacific SST, 


producing weaker northern trade winds and smal 


er onshore pressure gradient in the North Pacific, 


weakening the WNPSM. (B) During Boreal summer insolation minima, the opposite occurs. 


through changing SISH strength could strongly 
affect AMOC and the transport of heat to the 
high North Atlantic region. 

Counteracting effects might also occur, how- 
ever, because southern trade wind intensification 
can also lead to increased Agulhas mean velocity, 
resulting in Agulhas overshoot in the region south 
of South Africa. In this case, increased Agulhas 
momentum could carry the current past the 
zero-wind stress curl latitude into the region 
dominated by the westerlies, reducing Agulhas 
leakage and increasing retroflection back into 
the south Indian Ocean. Instabilities in either one 
of these two phenomena could produce Dansgaard- 
Oeschger cycles in the North Atlantic through 
modulation of AMOC, explaining the close cor- 
respondence between Greenland ice core and 
Chinese cave records on orbital or even suborbital 
time scales (J-4). There is evidence for strong 
increases in Agulhas leakage prefacing each of 
the past five glacial terminations (38), on the 
basis of the sudden increases of Indian Ocean 
phytoplankton species in South Atlantic cores 
that were either concurrent with or occurred 
just before the ice age terminations. 


These observations illustrate how the cave 
isotope and '°Be rainfall proxy records can be 
reconciled and show that there are mechanisms 
by which the Asian monsoon can directly influ- 
ence high-northern-hemisphere climate, reopen- 
ing the fundamental paleoclimate question of 
whether, or to what degree, glacial and inter- 
glacial climate is being forced from high versus 
low latitudes. 
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Evolution of pallium, hippocampus, 
and cortical cell types revealed by 
single-cell transcriptomics in reptiles 


Maria Antonietta Tosches,*} Tracy M. Yamawaki,*} Robert K. Naumann,§ 
Ariel A. Jacobi,|| Georgi Tushev, Gilles Laurentt 


Computations in the mammalian cortex are carried out by glutamatergic and y-aminobutyric 
acid-releasing (GABAergic) neurons forming specialized circuits and areas. Here we 
asked how these neurons and areas evolved in amniotes. We built a gene expression 
atlas of the pallium of two reptilian species using large-scale single-cell messenger RNA 
sequencing. The transcriptomic signature of glutamatergic neurons in reptilian cortex 
suggests that mammalian neocortical layers are made of new cell types generated by 
diversification of ancestral gene-regulatory programs. By contrast, the diversity of reptilian 
cortical GABAergic neurons indicates that the interneuron classes known in mammals 
already existed in the common ancestor of all amniotes. 


mniote vertebrates (mammals, reptiles, 
and birds) originated from a common an- 
cestor about 320 million years ago. In all 
developing amniotes, the dorsal telence- 
phalon, or pallium, is patterned by the same 
signaling molecules and subdivided in homol- 
ogous embryonic regions (J). In adult amniote 
brains, however, the structures that arise from 
these homologous pallial regions have different 
morphologies and connectivity. For example, 
a six-layered neocortex exists only in mammals, 
and the dorsal ventricular ridge (DVR) is found 
only in birds and reptiles. The hippocampus may 
be the most conserved pallial region (2), but even 
there, it is uncertain whether all the subfields 
known in mammals (for example, the dentate 
gyrus) exist in nonmammals (3) (fig. S1). 
Gene expression data can help reconstruct the 
evolution of brain regions and cell types (4, 5). 
Here we used single-cell transcriptomics to study 
the evolution of neuronal diversity in the am- 
niote telencephalon. Because cell types defined 
through transcriptomics match those defined by 
morphology, physiology, and connectivity (6-8), 
single-cell mRNA sequencing can be applied 
both for cell-type discovery and for cross-species 
comparisons (9). We focused on reptiles, because 
they, unlike birds, have a layered cortex, and asked 
the following: (i) Can we identify molecular sim- 
ilarities and homologies between reptilian and 
mammalian pallial regions? (ii) Are there rep- 
tilian cortical neurons homologous to mamma- 
lian hippocampal neurons? (iii) Can we link the 


Max Planck Institute for Brain Research, Max-von-Laue 
Strasse 4, 60438 Frankfurt am Main, Germany. 

*These authors contributed equally to this work. 

}Corresponding author. Email: gilles.laurent@brain.mpg.de 
(G.L.); maria.tosches@brain.mpg.de (M.A.T.) {Present address: 
Amgen, Inc., 1120 Veterans Blvd., South San Francisco, CA 94080, 
USA. §Present address: Shenzhen Institutes of Advanced Tech- 
nology, Chinese Academy of Sciences, 1068 Xueyuan Ave., 
Shenzhen University Town, Nanshan District, 518055 Shenzhen, 
China. ||Present address: University of California, Davis, School of 
Medicine, 4610 X St., Sacramento, CA 95817, USA. 


Tosches e¢ al., Science 360, 881-888 (2018) 


reptilian three-layered cortex to the mamma- 
lian six-layered neocortex? (iv) Are mammalian 
y-aminobutyric acid-releasing (GABAergic) inter- 
neuron types also found in the reptilian cortex? 


Neuronal and glial cells in the 

reptilian pallium 

Using Drop-seq (8), we obtained a comprehen- 
sive, unbiased classification of adult cell types 
sampled from the pallium of a turtle and a lizard 
species (dissections and sequencing statistics 
in figs. S1 and S2 and tables S1 and S2). After 
quality filtering (fig. S3), we used unsupervised 
graph-based clustering of the transcripts from 
18,828 turtle and 4187 lizard cells (with a median 
of 2731.5 and 1918 transcripts per cell, respec- 
tively) and identified first-level clusters of neu- 
ronal and non-neuronal cells (Fig. 1, A to C). 
Among non-neuronal cells, we found clusters 
expressing prototypical markers of mammalian 
glial cells: mature oligodendrocytes and their 
precursors, ependymoglial cells, and microglia 
(Fig. 1, B to D). Ependymoglial cells coexpressed 
markers of mammalian astrocytes (GFAP), adult 
stem cells (SOX9), and ependymal cells (FOXJ1), 
suggesting a shared evolutionary origin of these 
cell types (fig. S4, A to C). Reptilian neuronal 
clusters included vesicular glutamate transporter 
type 1 (VGLUTT’) glutamatergic excitatory neu- 
rons, glutamate decarboxylase 1 and 2 (GADI* 
GAD2*) GABAergic inhibitory interneurons, and 
neural progenitor cells (Fig. 1, A to D), con- 
sistent with the existence of adult neurogenesis 
in reptiles. To compare the transcriptomes of 
turtle, lizard, and mouse neuronal and non- 
neuronal cells (6), we selected one-to-one ortho- 
logs differentially expressed among these cell 
types, defined a specificity score for each gene 
in each cluster, and computed pairwise rank cor- 
relations (see methods). This analysis supports 
correspondence between major neuronal and 
non-neuronal cell classes (fig. S4, D to F) across 
turtle, lizard, and mouse. 
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Subclustering of the turtle and lizard neurons 
revealed finer distinctions between and within 
excitatory glutamatergic and inhibitory GABAergic 
neuron types (Fig. 1, E to G, and fig. S5). From 
these neuronal data sets, we selected differentially 
expressed genes as markers for histological vali- 
dation and for unbiased comparisons with mam- 
malian cell types. We focused on the turtle data 
and used the lizard data for corroboration. 


Spatial segregation of glutamatergic 
neurons: A molecular map of the 
reptilian pallium 

Our dissections encompassed multiple regions 
of the reptilian pallium likely to contain hetero- 
geneous populations of glutamatergic neurons. 
To establish the anatomical location of our turtle 
glutamatergic-cell clusters, we selected highly 
variable genes in the data set that were ex- 
pressed in some clusters but not detected in 
others. These markers were expressed in distinct 
regions of the pallium (fig. S6). The combinato- 
rial expression of markers defined “superclus- 
ters,” seen also as groupings of adjacent clusters 
in t-distributed stochastic neighbor embedding 
(tSNE) plots (Fig. 2, A to B, and fig. S6A; compare 
Fig. 1E and Fig. 2B), in agreement with higher- 
level clustering analysis and with our tissue dis- 
sections (figs. S7 and S8D and tables S1 and S2). 
These superclusters correspond to defined ana- 
tomical regions: the medial and dorsomedial 
cortices, the anterior and posterior dorsal cortex, 
the pallial thickening, the anterior and poste- 
rior lateral cortex, and the anterior and poste- 
rior DVR (fig. S8, A to C). Cell-type similarity was 
generally higher within than between superclus- 
ters (fig. S8, F and G). Weighted gene correlation 
network analysis (WGCNA, see methods) indi- 
cates that the unique genetic signature of each 
region results from different combinations of 
gene modules (Fig. 2C). We also associated gluta- 
matergic clusters to anatomically defined pallial 
regions in lizard (fig. S9). Corresponding regions 
in lizard and turtle have different relative sizes 
(for example, lizard anterior dorsal cortex is small) 
but are clearly delineated by the expression of the 
same developmental transcription factors such as 
ZBTB20, SATBI, DACH2, and ETV1 (ER8I) (Fig. 
2D). These data define the molecular regionaliza- 
tion of the adult reptilian pallium. 

Putative homologies between reptilian and 
mammalian pallial derivatives are disputed 
C1, 2, 10, 11). Central to this debate is the an- 
terior DVR, one of the derivatives of the ventral 
pallium, which is enlarged in birds and reptiles. 
This region has been proposed as a homolog 
of either ventral pallium derivatives [claus- 
trum, endopiriform nucleus, and pallial amyg- 
dala (1, 2, 12)] or parts of the neocortex in 
mammals (5, 10, 11). The latter hypothesis sug- 
gests a dual evolutionary origin of the neocortex, 
either as separate regions—where medial and 
lateral neocortex are homologous to reptilian 
dorsal cortex and DVR, respectively (J7)—or as 
intermixed cell types, where separate neocortical 
layers are homologous to reptilian dorsal cortex or 
DVR (5, 10). We compared the turtle superclusters 
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to mammalian pallial derivatives, using a human 
microarray data set as a reference [(3); see anal- 
ysis in methods]. 

Our analysis using all differentially expressed 
genes (Fig. 2E) reveals similarities between tur- 
tle medial and dorsomedial cortices and human 
hippocampus, supporting previous hypotheses 
(3). The posterior dorsal cortex also showed posi- 
tive correlations to human hippocampus and 
subiculum but low negative correlations with 
nonhippocampal cortices, consistent with the 
gene network analysis (Fig. 2C). Earlier studies 
recognized anatomical similarities between parts 
of the reptilian dorsal cortex and mammalian 
subiculum (2, 14), where ETV] (ER8I) is expressed 
(15). It may thus be that the posterior dorsal 
cortex relates to mammalian peri-hippocampal 
regions. 

Turtle pallial thickening and mammalian claus- 
trum shared expression of several claustrum- 
enriched markers (fig. S6B), consistent with 
anatomical and developmental data (1, 2). We 
also found correlations between reptilian lateral 
cortex and mammalian piriform cortex and 
between posterior DVR and pallium-derived 
amygdalar nuclei, with the exception of the lat- 
eral amygdala. Individual posterior DVR clusters 
expressed markers of mammalian basolateral 
(ETV1 and FEZF2) and cortical (Z7C2 and TBR) 
amygdala. These clusters mapped to distinct 
nuclei of the turtle posterior DVR, suggesting 
that these pallial amygdala subdivisions existed 
in the common ancestor of mammals and rep- 
tiles (fig. S10) (16). 

Mammalian neocortex showed correlations 
with several turtle pallial regions (Fig. 2E). Rea- 
soning that correlations based on all differentially 
expressed genes might not separate homology 


Fig. 1. Reptilian single-cell data sets. 

(A) tSNE representation of 18,828 turtle 
(Trachemys scripta elegans, ts) single-cell 
transcriptomes, with cells color coded by cluster. 
(B) Violin plots showing expression of neuronal 
and non-neuronal markers in turtle clusters. 

(C) tSNE representation of 4187 lizard (Pogona 
vitticeps, pv) single-cell transcriptomes, color 
coded by cluster. (D) Violin plots showing 
expression of neuronal and non-neuronal markers 
in lizard clusters. In (B) and (D), for each gene, 
violin plots are scaled to the maximum number 
of transcripts (unique molecule identifiers) 
detected for that gene (max). Exc, glutamatergic 
excitatory neurons; Inh, GABAergic inhibitory 
interneurons; NPC, neural progenitor cells; Olig, 
mature oligodendrocytes; OPC, oligodendrocyte 
precursors; EG, ependymoglial cells; MG, microglia; 
Leu, leucocytes; Mur, mural cells; Vend, vascular 
endothelial cells. Subclusters of Exc and Inh cells in 
(A) and (C) are not matched by name or color. 
(E) tSNE plot and clusters of 5901 high-quality 
turtle neurons. Inset is the tSNE plot in (A), 
indicating neuronal and glial clusters in dark red 
and gray, respectively. (F and G) Expression of 
the glutamatergic marker VGLUT1 (F) and 
GABAergic marker GAD1 (G) in turtle neurons. 
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from convergent evolution (because functional 
convergence could recruit the same effector 
genes under different transcription factors), we 
next restricted our analysis to transcription fac- 
tors. Under these conditions, only anterior dorsal 
cortex correlated with human neocortex (Fig. 
2F). Anterior dorsal cortex and anterior DVR 
differed by the expression of those transcription 
factors that, in mammals, are also found in either 
the neocortex or pallial amygdala (for example, 
NFIX, BCLIIA, and SATB2 in turtle anterior dorsal 
cortex and mouse neocortex; NR2F2 and DACH2 
in turtle anterior DVR and mouse amygdala) (fig. 
S8, H and I). Of the mammalian pallial amyg- 
dala subdivisions, only the lateral amygdala corre- 
lates with the anterior DVR. Our results suggest 
that (i) different combinations of transcription 
factors may regulate the expression of “neo- 
cortical” effector genes in anterior dorsal cortex 
and anterior DVR and (ii) transcription-factor 
expression reflects the developmental (and evo- 
lutionary) history of pallial neurons. We propose 
that reptilian anterior dorsal cortex and mam- 
malian neocortex are homologous as dorsal 
pallium derivatives and that reptilian DVR and 
mammalian pallial amygdala are homologous 
as ventral pallium derivatives [see also (J)]. In 
reptiles, the expansion of the sensory-recipient 
anterior DVR led to the emergence of neuronal 
types specialized in processing different sensory 
modalities, recognizable as separate molecular, 
anatomical, and functional clusters (figs. S9 and 
$10) (77). According to this hypothesis, reptilian 
anterior DVR and mammalian neocortex ac- 
quired, by convergent evolution, the expression 
of similar sets of effector genes. 


Conservation of hippocampal neuronal 
types and areal organization 


Anatomical and developmental evidence suggest 
that the reptilian medial-most cortex is homol- 
ogous to mammalian hippocampus (3, 18). In 
line with this, turtle and lizard medial cortices 
express the mammalian pan-hippocampal tran- 
scription factor ZBTB20 (Fig. 3A and fig. S11). 
Whether individual hippocampal subfields [den- 
tate gyrus (DG), cornu ammonis (CA)3, CA2, and 
CAI] are present in reptiles and birds is less clear. 
Developmental evidence suggests so (18), but some 
describe mammalian DG, with its mossy cells 
and granule cells, as a mammalian novelty (3). 
Reptilian ZBTB20-positive clusters could be 
further distinguished by the expression of mam- 
malian DG or CA transcription factors: In turtle, 
PROX! and MEF2C (specifying mouse DG granule 
cells) labeled the medial cortex, and ETV1, MEIS2, 
and LMO4 (CA) labeled the dorsomedial cortex 
(Fig. 3A). The expression of these genes in ad- 
jacent domains of turtle and lizard medial cor- 
tices (fig. S11) suggests the existence of DG- and 
CA-like neuronal types in reptiles. This was fur- 
ther supported by unbiased analyses of cell-type 
transcriptomes. Using WGCNA to identify and 
compare gene modules [mouse data from (19), 
see methods], we found statistically significant 
overlaps between mouse DG and turtle medial 
cortex modules and between mouse CA and 
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turtle dorsomedial cortex modules (fig. S12). 
Mouse DG and turtle medial cortex modules 
shared genes coding for K*-channel subunits 
or associated proteins (KCNGI, KCNAI, and 
KCNIP4), possibly accounting for electrophysiolog- 
ical similarities (20). Other shared genes included 
the cadherin CDH8, involved in the formation of 
DG-CA3 synapses, and the granule-cell specific 
regulators of synaptogenesis and AMPA receptor- 
mediated synaptic transmission LRRTM4 and 
CNIH3 (fig. S12). Hence, DG granule cells, in- 
cluding aspects of their membrane and synaptic 
physiology, are conserved across mammals and 
reptiles (20). [No mossy-cell marker (27) had cell- 
type specific expression in turtle medial cortex.] 

Next, we computed pairwise cluster correla- 
tions using one-to-one orthologs differentially ex- 
pressed among the ZB7TB20" clusters. This revealed 
further heterogeneity among the ZBTB20* ETV1* 
cells, with turtle dorsomedial cortex clusters show- 
ing best mutual correlations to mouse CA3 or CA1 
(Fig. 3B). Mammalian CA3 or CA1 markers (19, 27) 
were differentially expressed in these turtle clus- 
ters (Fig. 3C). CA3- and CA1-like cells occupy ad- 
jacent positions [with CA3-like cells closest to 
medial cortex (DG)] and form morphologically 
distinct cell plates (Fig. 3A and fig. S11). We 
found no evidence for a reptilian CA2 cell type 
(Fig. 3B and fig. S11B). 

In summary, reptilian hippocampus can be 
subdivided into adjacent territories similar to 
the mammalian DG, CA3, and CAI fields. The 
developmental origin of these cells from the 
medial pallium (/8) and their similar medio- 
lateral distribution, connections (22), biophysical 
properties (20), and molecular identities (this 
study) support the hypothesis that mammalian- 
like hippocampal regions were already present 
as adjacent fields in the ancestor of all amniotes. 
The architecture of the mammalian hippocam- 
pus, with its signature infoldings, may thus result 
from the considerable enlargement of neocortex 
in the mammalian lineage. 


Transcriptomic signatures of 
neocortical upper and deep layers 
in turtle dorsal cortex 


Mammalian six-layered neocortex evolved from 
a presumed three-layered ancestral cortex in a 
stem amniote. What is the relationship between 
the neurons and layers in the mammalian neo- 
cortex and the reptilian cortex? Earlier studies 
suggested that L2/3 and L4 (“upper layer,” UL) 
neurons are mammalian novelties; reptilian 
pyramidal neurons have thus been compared 
to the mammalian early born L5 and L6 (“deep 
layer,’ DL) neurons (23, 24). By contrast, con- 
nectivity implies that reptilian cortex should 
harbor both L4 (that is, thalamorecipient) and 
L5 (cortico-fugal) types, and it has been suggested 
that these types, rather than occupying different 
layers, populate adjacent fields in turtle cortex: 
one in anterior dorsal cortex (RORB* 14-like neu- 
rons) and one in posterior dorsal cortex (ETV1* 
L5-like neurons) (5, 10). 

Our data indicate that anterior dorsal cortex 
is the only region comparable to mammalian neo- 
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cortex (Fig. 2). We examined the expression of 
prototypical mammalian-layer markers (7, 25, 26) 
in the six turtle anterior dorsal cortex gluta- 
matergic clusters (e07, e08, and e13 to e16). These 
cells expressed several mammalian UL and DL 
markers, but these genes were often coexpressed 
in the same clusters (Fig. 4A and fig. S13A). When 
we focused on the transcription factors that es- 
tablish and maintain cell identity in the neo- 
cortex, we observed that, in the turtle, these genes 
were expressed in combinations that were never 
observed in differentiated mammalian cortical 
neurons. For example, all turtle anterior dorsal 
cortex cell types coexpress genes enriched in 
mammalian L2/3, L4, and Lda intratelencephalic 
neurons, including SATB2, RORB, and RFX3, as 
well as genes specifying L5b and L6 corticofugal 
projection neurons, such as BCLIIB (CTIP2), TBR1, 
and SOX (all clusters except e13) (25) (Fig. 4, 
A and B). In mouse neocortex, some of these 
genes are known to repress each other’s expres- 
sion in postmitotic cells (for example, Satb2 and 
Belilb; Tbr1 and Bcliib) (26). 

Extending the comparative analysis to all dif- 
ferentially expressed genes revealed that anterior 
dorsal cortex cell types correlated with either 
mammalian UL (e13 to e16) or DL (e07 and e08) 
neurons, independent of the neocortical area 
used for comparison (Fig. 4C and fig. S14, A to 
C). This trend was confirmed by gene network 
analysis (fig. S13, B and C). By contrast, anterior 
DVR clusters could not be grouped into UL- and 
DL-like classes (fig. S14, D to F). 

As anticipated by the sequencing data, in situ 
hybridizations (ISHs) showed coexpression of 
mammalian UL and DL transcription factors 
throughout the turtle anterior dorsal cortex 
(Fig. 4B). Individual UL-like neuronal types (e13 
to e16) were differentially distributed along the 
mediolateral and rostrocaudal axes of the anterior 
dorsal cortex, matching known anatomical sub- 
divisions (fig. S15). By contrast, the DL-like cells 
e07 and e08, identified by parathyroid hormone- 
like hormone (PTHLH) expression, appeared in- 
terspersed throughout the anterior dorsal cortex. 
In the rostro-lateral dorsal cortex, DL-like cells 
were confined to the superficial part of L2 (Fig. 
4D; additional markers in fig. S15, C to E). These 
markers thus identify two distinct sublayers in 
turtle L2: a superficial L2a with mostly DL-like 
neurons and a deeper L2b with mostly UL-like 
neurons. Retrograde tracing from the thalamus 
labels L2a cells (27, 28), suggesting that these 
neurons, or a subset of them, are corticofugal 
and project to the thalamus. 

In conclusion, our transcriptome-wide com- 
parisons reveal the presence of global UL- and 
DL-like genetic signatures in distinct neuronal 
types of turtle anterior dorsal cortex but do not 
support, with the current resolution, homologies 
between turtle glutamatergic types and indi- 
vidual neocortical layers. In reptiles and mam- 
mals, UL and DL genetic signatures might have 
evolved independently from a stem amniote that 
lacked distinct UL- and DL-like neurons: Neu- 
rons with similar characteristics, such as input- 
output connectivity, would have acquired the 
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expression of similar gene sets by convergent 
evolution. Alternatively, UL- and DL-like neurons 
may have existed already in the dorsal cortex of 
stem amniotes. If true, the emergence of the six 
layers that form mammalian neocortex would 
be a novelty [sensu (29)], with the evolution of 
new pyramidal cell types through extensive mod- 
ifications of the genetic programs specifying an- 
cestral UL- and DL-like types. 


Conservation of GABAergic interneuron 
classes across amniotes 


GABAergic interneurons in mammalian neo- 
cortex are diverse and participate in different 
circuit motifs and computations (30). Little is 
known about cortical interneurons outside of 
mammals. In several vertebrates, including rep- 
tiles, GABAergic interneurons are generated in 
conserved subpallial regions—the medial, cau- 
dal, and lateral ganglionic eminences (MGE, CGE, 
and LGE)—and migrate to the pallium (37, 32). 
Reptiles, however, are thought to lack some mam- 
malian GABAergic types, such as cortical vaso- 
active intestinal peptide (VIP) interneurons (24). 
We examined the turtle GABAergic clusters (i01 
to i18) and the expression of transcription factors 
known to define mammalian GABAergic types. 


Fig. 2. Reptilian pallial regions. (A) Violin 
plots showing expression of genes discrimi- 
nating between spatially segregated glutama- 
tergic neurons in the turtle pallium. (B) tSNE 
showing turtle glutamatergic neurons colored 
by supercluster. Transcription factor names are 
in red. (©) Heatmap showing expression of 
module eigengenes calculated from turtle 
glutamatergic neurons. Pseudocells (averages 
of 4 to 5 cells used for WGCNA, see methods) 
are shown in columns, ordered by cluster and 
supercluster. (D) Expression of transcription 
factors that label corresponding pallial 

regions in turtle and lizard (ISH, frontal sections 
at different anterior-posterior levels; see 

also figs. S8 and S9). Scale bars, 1 mm. 

(E and F) Pairwise correlations of turtle gluta- 
matergic superclusters and human pallial 
regions, calculated from all genes (E) or 
transcription factors (F) differentially expressed 
in turtle or human. In mammals, only some 
parts of the cingulate gyrus (CgG), long insular 
gyrus (LIG), and short insular gyrus (SIG) are 
classified as neocortex [see (13) for human 
data]. Dots indicate statistically significant cor- 
relations. MC, medial cortex; DMC, dorsomedial 
cortex; pDC and aDC, posterior and anterior 
dorsal cortex; PT, pallial thickening; aLC and pLC, 
anterior and posterior lateral cortex; aDVR and 
pDVR, anterior and posterior dorsal ventricular 
ridge; DG, dentate gyrus; CA, cornu ammonis; 
Sub, subiculum; PHG, parahippocampal gyrus; 
OL, occipital lobe; TL, temporal lobe; PL, parietal 
lobe; FL, frontal lobe; Cl, claustrum; Pir, piriform 
cortex; LA, lateral amygdala; ATZ, amygdalohip- 
pocampal transition zone; BLA, basolateral 
amygdala; BMA, basomedial amygdala; COMA, 
corticomedial amygdala. 
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From the combinatorial expression of these genes, 
we identified putative MGE-derived (i07 to i13: 
LHX6*, SOX6", and SATBI*), CGE-derived (i14 to 
il8: NPAST’, SP8*, NR2EI’, and PROXT’), and LGE- 
derived (i01 and i04 to i06: MEIS2* ZICT and 
in subsets TSHZI* SIX3* and/or PBX3") clusters 
(Fig. 5A). Corresponding MGE-, CGE-, and LGE- 
derived neurons could also be identified in liz- 
ard (fig. S16, A to C). The remaining GABAergic 
clusters correspond to cells dissected from the 
neighboring septum (i02 and i03) and striatum 
(04). Stainings revealed the presence of LGE- 
and septum-derived interneurons in the amyg- 
dala (i04 and i05) and olfactory bulb (i01 and 
i02), indicating that LGE- and septum-derived 
GABAergic neurons have similar molecular iden- 


ZBTB20 


PROX1 


ETV1 


tities and migratory trajectories in reptiles and 
mammals (Fig. 5A and fig. S16, D and E). 

Turtle MGE- and CGE-derived interneurons 
could be further subdivided into somatostatin 
(“SST”), parvalbumin-like (“PV-like”), “HTR3A 
Reln,” and “HTR3A VIP-like” classes, match- 
ing the classification of mammalian cortical 
GABAergic interneurons (7, 30) (Fig. 5A). Pair- 
wise cluster correlations (Fig. 5B) and gene net- 
work analysis (fig. S17) further supported the 
conclusion that PV-like, SST, and HTR3A inter- 
neuron classes are conserved in reptiles and 
mammals. [Note that transcriptomics identifies 
VIP- and PV-like interneurons even though VIP 
and PVALB are not reliably expressed in these 
cells (Fig. 5A).] 
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Fig. 3. Molecular architecture of the reptilian hippocampus. (A) 
Expression of hippocampal markers in turtle medial (MC) and dorsomedial 
(DMC) cortices. Arrowheads indicate boundaries between DG, CA3, and 
CA1 (compare to schematic at right). D, dorsal; V, ventral; M, medial; L, 
lateral. (B) Pairwise correlations of turtle and lizard (top) and turtle and 
mouse (bottom) hippocampal clusters. Dots indicate statistically signifi- 
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In turtle and lizard, markers of MGE- and CGE- 
derived interneurons were expressed in cells scat- 
tered throughout the pallium, including the dorsal 
cortex (Fig. 5C). As in mammals, where MGE- 
and CGE-derived interneurons have different 
densities across cortical layers, neurons express- 
ing ADARB2 (HTR3A) and SST (SST) were found 
preferentially in superficial and deep dorsal 
cortex, respectively; NDNF (HTR3A Reln) was 
expressed in rare dorsal cortex subpial cells, 
reminiscent of mammalian L1 neurogliaform 
cells (Fig. 5, D and E). Because the same classes 
of cortical GABAergic neurons exist in mammals 
and reptiles, they likely existed in their amniote 
ancestor. Our analysis did not detect similarities 
at a finer level: For example, we found no turtle 
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cant correlations. In lizard, the ZBTB20* ETV1* medial cortices are 
termed DMC and D1 in the classical literature but correspond to turtle 
DMC. (C) On the left, expression of mammalian hippocampal markers in 
turtle single cells (columns, arranged by cluster) and, on the right, 
expression of the same markers in mouse [data from (19)]. Transcription 
factor names are in red. Scale bars, 200 um. 
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Fig. 4. UL- and DL-like neurons in turtle dorsal cortex. (A) Dot-plot 
showing expression of canonical mammalian layer markers (columns) in turtle 
anterior dorsal cortex (aDC) clusters (rows). Dot size indicates the percentage 
of cells in the cluster where the gene is detected; color indicates expression 
(exp) level. Names of transcription factors are in red. (B) ISHs showing 
expression of mammalian UL and DL transcription factors throughout the turtle 
aDC. Arrowheads indicate the medial and lateral boundaries of anterior dorsal 
cortex. Scale bar, 500 um. (C) Pairwise correlations of turtle aDC (columns) and 


mouse neocortex (rows) glutamatergic types. Dots indicate statistically 
significant correlations. Mouse clusters are from (7). (D) Left panel, turtle DL-like 
cells, labeled by PTHLH (green), are preferentially found in superficial L2 [blue, 
4’, 6-diamidino-2-phenylindole (DAPI)]. Middle panel, double ISH for SATB1 
(enriched in UL-like cells, magenta) and ETV1 (e07 DL-like cells, green), showing 
the relative positions of these cells. Right panel, schematic summarizing the 
distribution of UL- and DL-like glutamatergic neurons (inverse arrangement to 
that observed in mammals). See also fig. S15. Scale bar, 100 um. 


clusters corresponding to mammalian Martinotti 
or basket cells. This suggests that species-specific 
subtypes of interneurons diversified independently 
in mammals and reptiles from ancestral PV-like, 
SST, and HTR3A (possibly VIP-like and Reln) cell 
classes. 


Discussion 


Our single-cell data provide molecular support 
to the existence of conserved regions and cell 
types in the amniote pallium. Homologs of the 
mammalian neocortex, “core” hippocampus (with 
dentate and CA fields), claustrum, and pallial 
amygdala probably existed already in stem am- 
niotes. Glutamatergic neurons with an UL- or 
DL-like genetic signature exist in turtle anterior 
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dorsal cortex, but one-to-one homologies of these 
cell types and individual mammalian layer types 
are not supported by our data. By contrast, cor- 
tical GABAergic interneuron classes (SST, PV- 
like, and HTR3A) are ancestral in amniotes. 
Our analysis indicates that mammalian and 
reptilian brains diversified by expansion and in- 
dependent evolution of different pallial territo- 
ries and that these expansions coincided with 
the evolution of new neuronal types. The large 
reptilian anterior DVR (ventral pallium), for 
example, consists of spatially segregated neuro- 
nal types specialized in processing visual, audi- 
tory, or somatosensory stimuli (77). The same 
sensory pathways also reach the ventral pallium 
of mammals (for example, the lateral amygdala) 


25 May 2018 


but are not processed by dedicated neuronal pop- 
ulations there (12). This suggests that the elab- 
oration of DVR neurons and circuits is specific 
to reptiles and birds. 

The situation is reversed with the dorsal pal- 
lium. Mammalian neocortex emerged by expan- 
sion of a small dorsal pallial territory, homologous 
to the anterior dorsal cortex of reptiles and to 
the avian Wulst (J). This may have coincided 
with the generation of new glutamatergic cell 
types from the diversification of UL- and DL-like 
neurons. The sequential generation of neurons 
with distinct identities is a conserved feature of 
mammalian and reptilian (33) dorsal pallium 
progenitors, and corticothalamic neurons are 
early born neurons in mouse and turtle (28). 
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Neuronal birth order is the same in turtle and 
mammals (DL first, UL late) (28), and the super- 
ficial position of turtle DL-like neurons is consist- 
ent with the inversion of corticogenesis (outside-in 
in reptiles, inside-out in mammals) (28, 34). Fi- 
nally, in mammals, transcription factors specify- 
ing ULand DL fate are coexpressed in progenitors 
and acquire mutually exclusive expression only 
after cell-cycle exit (35). The coexpression of 
mammalian UL and DL fate specifiers in turtle 
neurons suggests that cortical layers may have 
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evolved from the remodeling of regulatory inter- 
actions between these transcription factors, possi- 
bly through new repressive interactions. The 
temporal extension of neurogenesis (33) could 
have enabled the segregation of originally over- 
lapping gene expression programs and thus the 
diversification of ancestral UL and DL types. 
These molecular maps of turtle and lizard pal- 
lium provide a framework for future functional 
studies. How do similarities and differences in 
gene expression programs inform us about the 
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Fig. 5. Turtle GABAergic interneurons. 

(A) On left, heatmap showing expression of 
selected mammalian GABAergic markers 
(rows) in turtle single cells, grouped by 
cluster (column), and, on right, expression 
of the same markers in mouse cortical 
GABAergic cells [data from (7)]. Names of 
transcription factors are in red. OB, olfactory 
bulb. (B) Pairwise correlations of turtle 
(columns) and mouse (rows) MGE- and 
CGE-derived GABAergic clusters. Dots 
indicate statistically significant correlations. 
(C) GABAergic neurons, labeled by ELAVL2, in 
turtle. (D) Double labeling of SST (immuno- 
histochemistry, magenta) and ADARB2, 
PLAU, or LHX6 (ISH, green) in turtle dorsal 
cortex. LHX6 labels both SST* (SST) and 
SST (PV-like) neurons. White arrowheads 
indicate SST neurons (SST* LHX6*); blue 
arrowheads indicate PV-like neurons 
(PLAU* LHX6* SST~); green arrowheads 
indicate HTR3A neurons (ADARB2"*). 

Scale bars, 20 um. (E) ISHs of GABAergic 
markers in turtle dorsal cortex. 


evolution of brain function? Much of reptilian 
cortex, for example, is molecularly related to core 
mammalian hippocampus. Lesion experiments 
in reptiles suggest a role for medial cortex in navi- 
gation (22), but no chronic electrophysiological 
data have, as of yet, been obtained from these 
regions. Integrating our molecular maps with 
tracing, electron microscopy, and electrophysio- 
logical studies is needed to understand whether 
and how gene expression programs are linked 
to the evolution of connectivity and function. 

Common principles of synaptic organization 
can be extrapolated from the comparison of three- 
and six-layered cortices (36). Are these similar- 
ities coincidental or do they reflect ancestral 
cortical circuits? In mammals, local circuit motifs 
involve GABAergic types with distinct molecular 
and functional features (30). Our study indicates 
that interneuron diversity is ancestral to both 
mammals and reptiles. The conservation of in- 
terneuron classes over at least 320 million years 
suggests not only that there is selective pressure 
for interneuron diversity but also that interneu- 
ron diversity itself may have evolved in stem 
vertebrates within primordial cortical circuits 
(37). Alternatively, the conservation of interneu- 
ron classes may result simply from developmental 
constraints on subpallium patterning. The com- 
bined molecular and functional investigation of 
circuit motifs in reptilian cortex may shed light 
on the ancestral design and computational logic 
of vertebrate cortices. 
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Cobalt-catalyzed asymmetric 
hydrogenation of enamides enabled 
by single-electron reduction 


Max R. Friedfeld,' Hongyu Zhong,’ Rebecca T. Ruck,” 


Michael Shevlin,”* Paul J. Chirik’* 


Identifying catalyst activation modes that exploit one-electron chemistry and overcome 
associated deactivation pathways will be transformative for developing first-row transition 
metal catalysts with performance equal or, ideally, superior to precious metals. Here we 
describe a zinc-activation method compatible with high-throughput reaction discovery that 
identified scores of cobalt-phosphine combinations for the asymmetric hydrogenation of 
functionalized alkenes. An optimized catalyst prepared from (R,R)-Ph-BPE {Ph-BPE, 1,2- 
bis[(2R,5R)-2,5-diphenylphospholano]ethane} and cobalt chloride [CoClz-6H20] exhibited 
high activity and enantioselectivity in protic media and enabled the asymmetric synthesis 
of the epilepsy medication levetiracetam at 200-gram scale with 0.08 mole % catalyst 
loading. Stoichiometric studies established that the cobalt (Il) catalyst precursor 
(R,R)-Ph-BPECoCl, underwent ligand displacement by methanol, and zinc promoted facile 
one-electron reduction to cobalt (1), which more stably bound the phosphine. 


symmetric catalysis with soluble metal 

complexes has transformed the preparation 

of single enantiomers in the pharmaceuti- 

cal, fragrance, and fine-chemical industries 

C, 2). Because different antipodes of chiral 
molecules often exhibit distinct biological prop- 
erties, the U.S. Food and Drug Administration 
has strict requirements for single-enantiomer 
drugs, and the importance of asymmetric trans- 
formations in the pharmaceutical industry will 
continue to grow. Beginning with Knowles’s syn- 
thesis of the Parkinson’s medication 1-dopa by 
rhodium-catalyzed asymmetric alkene hydro- 
genation (3), catalysis by homogeneous catalysts 
containing precious metals with tunable lig- 
ands has revolutionized the approach to single- 
enantiomer active pharmaceutical ingredients 
(APIs). 

The widespread application of asymmetric 
hydrogenation, particularly in the pharmaceutical 
industry, has motivated efforts to identify cata- 
lysts based on earth-abundant first-row transi- 
tion metals rather than traditionally used precious 
metals (4). In alkene hydrogenation catalysis, 
rhodium and iridium catalysts operate by pre- 
dictable, two-electron cycles involving oxidative 
addition and reductive elimination [for example, 
Md)-M(IID)]. However, compared to their heavier 
congeners, first-row transition metals have kinet- 
ically and thermodynamically accessible oxidation 
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states separated by one electron, often to the 
detriment of catalytic chemistry (5). Although 
considerable progress has been made (6, 7), state- 
of-the-art catalysts with iron, cobalt, and nickel 
lack many of the favorable properties associated 
with precious metal catalysts that facilitate scale 
up. Alkene hydrogenation catalysts with earth- 
abundant metals are typically air- and moisture- 
sensitive, requiring rigorously dried solvents; are 
intolerant of many polar functional groups found 
in APIs; and have insufficient activity to be ap- 
plied industrially. 

Asymmetric hydrogenation and other enantio- 
selective metal-catalyzed reactions often rely on 
the successful relay of stereochemical information 
from a chiral ligand to the substrate (8). There- 
fore, understanding and controlling ligand co- 
ordination and dissociation equilibria are key 
to enabling catalyst stability and communicating 
stereochemical information. Unlike other tactics 
for improving catalyst performance, rational con- 
trol of catalyst activation and deactivation are 
challenging and often circumvented by increased 
catalyst loadings (9-11). With classic transition 
metal catalysts such as Wilkinson’s (Ph3P),RhCl 
(72) and (PhsP)3;RuCl, (13) (Ph, phenyl), catalyst 
activation and, ultimately, performance is limited 
by phosphine dissociation equilibria and halide 
coordination (Fig. 1, A and C). These limitations are 
overcome with weakly coordinating anions and 
hydrogenation of ancillary diene or triene ligands 
to create open coordination sites (Fig. 1, B and 
D), as exemplified by the Schrock-Osborn-type 
catalysts [(P,P)Rh(diene)][X] (74), where X is 
any noncoordinating anion, and the cationic 
ruthenium catalyst [(P-P)Ru(H)(triene)][BF 4] (75). 
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These design principles have proven useful for 
fast, reliable catalyst activation and are widely 
used for new catalyst discovery. 

Our laboratory has reported that two-carbon- 
bridged, C-symmetric (bis)phosphines support 
highly active and enantioselective cobalt cata- 
lysts for the asymmetric hydrogenation of simple 
dehydro-o-amino acid derivatives (6). Although 
these catalysts are state-of-the-art among first- 
row metals and provide an important demon- 
stration of the promise of earth-abundant metals 
in asymmetric alkene hydrogenation, major lim- 
itations include the use of pyrophoric activators 
such as LiCH,SiMes, extreme air sensitivity of 
the catalyst, and lack of reactivity among many 
classes of phosphines, likely owing to catalyst 
deactivation by irreversible loss of ligand (Fig. 1E). 
Isolated organometallic compounds such as (R,R)- 
QuinoxP*Co(CH2SiMe3). and (R,R)Pr-DuPhosCo 
(CH,SiMes). (Pr, isopropyl), albeit more active, 
require multistep organometallic syntheses and 
special handling that are likely impractical for 
industrial application. Here we describe advances 
in cobalt-catalyzed asymmetric alkene hydrogena- 
tion, where mechanistic insights into ligand dis- 
sociation equilibria and the unique properties of 
the first-row transition metals are leveraged to 
address fundamental limitations of existing ca- 
talysts. Two sequential single-electron reductions 
of substitutionally labile Co(II) complexes result 
in formation of more robust catalysts in situ. This 
advance in catalyst activation enabled the dis- 
covery of scores of effective metal-ligand combina- 
tions for asymmetric hydrogenation, culminating 
in the use of low catalyst loadings for the prac- 
tical, pilot-scale synthesis of an API. 

The hydrogenation of dehydro-levetiracetam 
(1) was selected for initial catalyst development 
studies to highlight the challenges associated 
with API synthesis. The corresponding chiral 
product, levetiracetam (Keppra), is a widely used 
medication for epilepsy (6). In one patented 
route (17), levetiracetam was prepared by as- 
ymmetric hydrogenation using an optimized 
condition of 0.5 mole % (mol %) of in situ- 
generated [(S,S)-Et-DuPhosRh(COD)][OTf] in di- 
chloromethane (Et, ethyl; COD, 1,5-cyclooctadiene; 
OTf, triflate). The relatively high catalyst loading 
and use of a noncoordinating, chlorinated sol- 
vent reflects the challenges associated with hydro- 
genation of 1 as a poorly coordinating substrate 
(17) with limited conformational flexibility for 
achieving two-point binding (18). 

Initial studies on hydrogenation of 1 relied on 
high-throughput experimentation to evaluate re- 
action variables, including solvents, cobalt sources, 
activators, temperature, and catalyst loadings 
(tables S1 to S6). A remarkable solvent dependence 
was identified: Protic solvents such as methanol 
(MeOH), ethanol, and trifluoroethanol provided 
the highest yields and enantiomeric excesses 
(tables S1 to S4). These unexpected results in- 
dicated that cobalt hydrides competent for en- 
antioselective alkene hydrogenation could, despite 
their anticipated hydricity, be formed in protic 
solvents. The use of alcohol solvents has been 
prevalent since the discovery of enantioselective 
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Fig. 1. Catalyst-activation strategies for alkene hydrogenation 
catalysts. (A) Activation of Wilkinson's catalyst (Ph3P)3RhCl is limited by 
unfavorable PPhg3 dissociation equilibrium and strongly coordinating Cl”. 
(B) Schrock-Osborn-type catalysts are paired with a weakly coordinating 
anion such as PF. and rely on irreversible hydrogenation of diene 
ligands to open coordination sites. (©) Wilkinson's ruthenium catalyst is 


homogeneous hydrogenation with transition 
metal catalysts (19). Catalysts containing Rh(1), 
Ir(J), and Ru(II) are usually highly active in 
MeOH, in part, because the solvent can act as a 
stabilizing ligand occupying open coordination 
sites on the metal but be readily displaceable 
by alkene substrates during catalysis (Fig. 1, A, 
B, and D). Also highly preferred in process scale 
reactions, MeOH is a green solvent (20) with 
excellent solubility for both H, and drug inter- 
mediates with polar functional groups. 

The proposed mechanism of cobalt-catalyzed 
hydrogenation of dehydro-a-amino acid deriva- 
tives involves formation of a Co(II dihydride as 
the active species that promotes alkene insertion 
(Fig. 1E) (21). We wondered whether this po- 
tential intermediate could be accessed instead 
from H, oxidative addition to Co(0), a pathway 
reminiscent of Rh(I) and Ir(I) catalysis. This 
strategy, in turn, would require a convenient 
means to reduce cobalt precursors from higher 
oxidation states. 
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ligand; cat., catalyst. 


Mild reducing agents compatible with MeOH, 
such as readily available and easily handled Zn, 
Mn, Mg, and Fe powders, were examined for in 
situ activation of Co(II) phosphine complexes 
for hydrogenation of 1. Both high yields and en- 
antiomeric excesses (ee’s) of levetiracetam were 
obtained after optimization with most in situ 
metallic reductants (table S7), suggesting the 
generation of reduced Co species in MeOH, which 
activated H, and were effective for the enantio- 
selective reduction of 1. Filtration of the catalyst 
suspension to remove excess reductant and the 
accompanying metal salt had minimal effect on 
catalyst performance, which simplified the reac- 
tion setup. Among the metal powders examined, 
Zn dust provided the best results, with respect to 
both product yields and enantioselectivity. 

Given the sensitivity of ligand-substrate inter- 
actions to subtle structural manipulations, the 
broad application of earth-abundant transition 
metal catalysts for asymmetric alkene hydrogena- 
tion will depend on the realization of a practical, 


25 May 2018 


activated by base in Hz to form ruthenium monohydride but suffers 

from limited coordination sites. (D) Cationic ruthenium catalysts are 
designed to open coordination sites. (E) Previous work on cobalt catalysts 
relied on activation by alkyl lithium reagents and posited formation of 
cobalt dihydride as an active catalyst. S, solvent molecule; L, neutral 


rapid, and reproducible method for evaluating 
various metal-ligand combinations for the ever- 
changing structures of alkenes found in the drug 
pipeline. Accordingly, the optimized reaction con- 
ditions, CoCl. with Zn dust as an activator in 
MeOH, were applied to a high-throughput evalua- 
tion of an expanded library of 216 chiral bidentate 
ligands. Zinc activation enabled high activity 
and enantioselectivity across a broad range of li- 
gands, including bis(phosphine), secondary phos- 
phine oxide-phosphine, phosphine-oxazoline, and 
phosphine-thioether with varied chelate sizes 
(Fig. 2C and fig. $13). The number of ligands ef- 
fective under Zn activation was remarkably higher 
than the corresponding compatibilities with the 
previously reported (6) organolithium-activation 
method (Fig. 2A; for Zn activation, only the 192- 
ligand library results are plotted). The high 
efficacy of the Zn-activation method was not 
limited to the asymmetric hydrogenation of 
1. Scores of cobalt-phosphine combinations 
were identified for the successful asymmetric 
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Fig. 2. High-throughput evaluation of chiral phosphine ligands. (A) 
Comparing results of 192 chiral ligands for the asymmetric hydrogenation 
of dehydro-levetiracetam by using LiCH2SiMe3 (top, table S9) and Zn 
(bottom, table S8). (B) Ligand compatibility with LiCH2SiMe3 (top; see 
also table S19) and Zn (bottom; see also table S20) for hydrogenation of a 


hydrogenation of N-(1-phenylvinyl)acetamide, 
methyl 2-acetamidoacrylate, and methyl (Z)- 
2-acetamido-3-phenylacrylate (figs. S16 to S18 
and tables $21 to $24), highlighting the improved 
versatility over the previously reported LiCH,. 
SiMe;-activation method (6). The hydrogenation 
of methyl 2-acetamidoacrylate was particularly 
notable (Fig. 2B), as essentially all phosphines in 
the library produced quantitative conversion to 
product. The highly successful Zn-MeOH reduc- 
tion is likely attributable to enhanced activation 
and suppressed deactivation of catalyst: Low 
oxidation-state Co species with open coordina- 
tion sites are generated owing to the high Cl” 
binding affinity of Zn**, and formation of metallic 
Co(0) is likely suppressed owing to favorable 
ligand coordination to reduced Co species. The 
robustness of the Zn method was also demon- 
strated by the successful hydrogenation of 1 in 
the presence of 5 volume % of water in MeOH. 
Among the many successful ligands identified, 
(R,R)-Ph-BPE {Ph-BPE, 1,2-bis[(2R,5R)-2,5-diphe- 
nylphospholanojethane} and (R,R)-Pr-DuPhos 
were selected for subsequent studies because of 
their outstanding performance at low (0.20 mol %) 
catalyst loadings (Fig. 2C and table S17). Although 
Zn activation of bis(phosphine)-cobalt catalysts 
has been reported in C-C bond-forming reac- 
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No.1 - No.192 chiral bidentate ligands 


enantiomers, respective 


tions (22, 23), the protocol was not applied to 
hydrogenation in protic solvents. 
Stoichiometric experiments were conducted 
to determine which oxidation states of Co were 
formed under catalytic conditions and to un- 
derstand the origin of the improved catalyst 
fidelity. Compound (R,R)-Ph-BPECoCl, (2a) was 
isolated after addition of the phosphine to a 
tetrahydrofuran (THF) solution of CoClz-6H20 
in 96% yield and crystallographically charac- 
terized (fig. S50). A high spin, S = 3/2 ground state 
for 2a was determined by electron paramagnetic 
resonance (EPR) spectroscopy (fig. S9) and solu- 
tion magnetometry. Surprisingly, dissolution of 
the dark purple 2a in MeOH resulted in com- 
plete dissociation of the (bis)phosphine from 
the substitutionally labile Co(II) center to form 
a proposed solvento complex [Co(MeOH)g][Cl]o 
along with precipitation of free ligand, as judged 
by ultraviolet-visible spectroscopy and *!P nu- 
clear magnetic resonance spectroscopy (figs. 
S40 to $44). Phosphine dissociation proved re- 
versible, as removing the MeOH solvent and 
dissolving the residue in THF reconstituted 2a. 
Cobalt dichloride, (R,R)-'Pr-DuPhosCoCls (2b), 
underwent partial ligand dissociation in MeOH 
(fig. S45). These findings suggest that unfavor- 
able ligand dissociation from Co(II) by MeOH 
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representative alkene methyl-2-acetamidoacrylate. (C) Ligands in the 
expanded 216-ligand library that give highest enantioselectivity for dehydro- 
levetiracetam hydrogenation. Catalyst loadings were 10 mol % unless 
otherwise noted. Positive and negative ee values correspond to (S) and (R) 


ly. Me, methyl: ‘Bu, tert-buty. 


displacement could be a catalyst-deactivation 
pathway. 

Further studies revealed how the Zn-MeOH 
method overcomes the unfavorable ligand loss 
for productive catalysis. Under conditions resem- 
bling those of the catalytic reaction, an equi- 
molar mixture of (R,R)-Ph-BPE and CoCls-6H2O 
was stirred with excess Zn in 1:1 THF:MeOH. 
A chloride-bridged Co(I) dimer [(R,R)-Ph-BPECo 
(u-Cl)]J2 (a) was isolated and crystallographically 
characterized (fig. S51) as the Zn-reduction prod- 
uct. The analogous ‘Pr-DuPhos complex, 3b, was 
prepared and characterized (fig. S53) by using the 
same method. Both Co(I) complexes 3a and 3b 
exhibit S = 1 ground states, as judged by their 
paramagnetic, EPR-silent nature and solution 
magnetic moments. Phosphine displacement 
by MeOH was not observed with 3a or 3b, in- 
dicating that reduction of Co(ID to Co(I) reduces 
phosphine lability and enhances inertness to 
substitution by MeOH. Consistent with these 
observations, Zn reduction of [Co(MeOH).] 
[Cl]2 also yielded 3a after addition of an equi- 
molar amount of (R,R)-Ph-BPE. MeOH also fac- 
ilitated reduction of 2a by Zn, as the reaction 
was complete in 15 min at 23°C; an identical 
procedure in THF required 12 hours. The Zn and 


MeOH work synergistically to impart catalyst 
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Fig. 4. Applications at large scale. (A) Previous work on cobalt-catalyzed asymmetric hydrogenation of dehydro-a—amino acid derivatives. (B) Patented 
route for asymmetric hydrogenation for levetiracetam synthesis by a rhodium catalyst in dichloromethane solvent. (C) Industrially relevant cobalt- 
catalyzed asymmetric hydrogenation for levetiracetam synthesis in MeOH solvent. 


fidelity. Phosphine dissociation, a deleterious 
catalyst-deactivation pathway, is corrected by 
the fast Zn reduction in MeOH to regenerate 
and preserve the chiral environment on the metal. 
Abstraction of chloride by Zn also opens coordi- 
nation sites on the Co for H, activation and alkene 
coordination. 

Cobalt (0) complexes were also prepared by 
two sequential one-electron reductions upon pro- 
longed treatment with Zn-MeOH in the presence 
of ancillary ligands (fig. S1). The diene and arene 
Co(0) derivatives, (R,R)-Ph-BPECo(COD) (41a) and 
(R,R)-Ph-BPECo(n°-CgHg) (5) were synthesized 
from Zn reduction of either Co(II) complex 2a or 
from Co(1) complex 3a. Complexes 4a and 5 were 
characterized by EPR (figs. S10 and S11), establish- 


Friedfeld et al., Science 360, 888-893 (2018) 


ing an S = 1/2 ground state, and 5 was also char- 
acterized by x-ray diffraction (fig. S52). Compound 
(R,R)-Pr-DuPhosCo(COD) (4b) was also prepared 
from Zn reduction of either 2b or 3b and char- 
acterized by EPR and x-ray diffraction (figs. $12 
and S54). These observations suggest that both 
Co(I) and Co(0) are accessible via one-electron 
reductions using the Zn-MeOH protocol. 
Alternative routes other than Zn reduction 
to complexes 3a, 3b, 4a, 4b, and 5 were also 
developed (fig. S1). The Co(ID dialkyl complex 
(R,R)-Ph-BPECo(CH,SiMes). (6) was synthesized 
by reported procedures (6), and hydrogenation 
of an equimolar mixture of 2a and 6 yielded 
the monochloride dimer 3a along with SiMe,. 
Hydrogenation of 6 in the presence of diene or 
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arene produced the Co(0) complexes 4.a and 5. 
The corresponding Co(I) and Co(0) complexes 
with (R,R)-'Pr-DuPhos were also prepared from 
the cobalt dialkyl. 

To demonstrate the catalytic performance of 
the isolated Co(I) and Co(0) sources, reactions 
with preformed 3a or 4a were conducted. Hy- 
drogenation of 1 in the presence of 0.5 mol % of 
Zn-free 3a in MeOH with 500 psi (34 atm) of 
H, at 50°C produced levetiracetam in 99.9% 
yield and 98.2% ee (Fig. 3B), demonstrating that 
single-component, Zn-free Co(I) complexes are 
also effective for synthesis of the API. EPR ex- 
periments (figs. S47 to S49) support a pathway 
whereby the substrate 1 induces disproportion- 
ation of the Co(I) monochloride 3a into Co(II) 
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dichloride 2a and a Co(0) alkene complex, with 
the latter undergoing reaction with Hy» to enter 
the catalytic cycle. A notable pressure dependence 
was observed with 3a, as hydrogenation of 1 at 
55 psi (3.7 atm) of H, resulted in only 38.5% yield 
after 72 hours, highlighting that Hy, is key to favor 
equilibria to generate and trap a Co(II) dihydride 
from 3a (21, 24). Consistent with this hypothesis, 
the cobalt diene precursor 4a was effective at the 
lower Hy pressure of 55 psi (3.7 atm) and pro- 
duced levetiracetam in 99.1% yield and 97.5% ee 
(Fig. 3B). Hydrogenation of the diene ligand 
liberated cyclooctane and likely generated the 
Co(II) dihydride, which, in the absence of chlo- 
ride, is free from inhibitive binding equilibria. 
Similar results were obtained with 5. Deuterium 
labeling experiments were conducted with the Zn- 
activated catalyst 3a and D, gas in natural abun- 
dance MeOH and furnished 1,2-d5-levetiracetam 
(d, deuterium), supporting homolytic H, cleavage 
and formation of cobalt dihydride, distinct from 
previous systems where heterolytic H, cleavage 
by metal carboxylates and protonation by solvent 
are the proposed mechanism. (7, 25). 

To examine practicality in an industrial set- 
ting, the Co catalysts were applied to larger scale 
hydrogenations. Zinc activation proved optimal 
and most convenient and was therefore selected 
for these experiments. Quantities ranging from 
1.3 to 20 g of 1 were successfully hydrogenated 
with 0.06 mol % each of (R,R)-Ph-BPE and 
CoCl.:6H,O in MeOH and 0.6 mol % Zn, pro- 
ducing levetiracetam in up to 100% yield and 
97.5% ee in solution. Monitoring H, uptake es- 
tablished a highly reactive catalyst in the first 
few minutes after H. was introduced, and the 
reaction was completed in 4 hours (fig. S15); 
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special care must be taken with higher loadings 
to avoid heat generation from the exothermic 
reaction. Immediate introduction of H, after cat- 
alyst injection results in the best catalyst per- 
formance. By using these optimized conditions, a 
200-g-scale hydrogenation was carried out with 
a slightly higher loading of 0.08 mol %, and 
levetiracetam was obtained in 97% isolated yield 
and 98.2% ee (Fig. 4C), demonstrating that earth- 
abundant metals are competent, if not superior 
to precious metals, for the synthesis of high-value 
single-enantiomer APIs at an industrial scale in 
environmentally benign solvent. These findings 
highlight the benefits of first-row transition metals 
in catalysis where oxidation states separated 
by one electron offer distinct strategies for im- 
proving catalyst performance and lifetime. 


REFERENCES AND NOTES 


1. C.S. Shultz, S. W. Krska, Acc. Chem. Res. 40, 1320-1326 
(2007). 

2. L.A. Saudan, Acc. Chem. Res. 40, 1309-1319 (2007). 

3. W. S. Knowles, Angew. Chem. Int. Ed. 41, 1998-2007 

(2002). 

4. P. J. Chirik, Acc. Chem. Res. 48, 1687-1695 (2015). 

S. Ge, R. A. Green, J. F. Hartwig, J. Am. Chem. Soc. 136, 

617-1627 (2014). 

6. M. R. Friedfeld et al., Science 342, 1076-1080 (2013). 

7. M. Shevlin et al., J. Am. Chem. Soc. 138, 3562-3569 (2016). 

8. T.P. Yoon, E. N. Jacobsen, Science 299, 1691-1693 (2003). 

9. R.H. Crabtree, Chem. Rev. 115, 127-150 (2015). 

10. D. Heller, A. H. M. de Vries, in The Handbook of Homogeneous 
Hydrogenation, J. G. de Vries, C. J. Elsevier, Eds. (Wiley-VCH, 
2007), pp. 1483-1516. 

ll. J. M. Hoyt et al., Organometallics 33, 5781-5790 (2014). 

12. J. A. Osborn, F. H. Jardine, J. F. Young, G. Wilkinson, J. Chem. 
Soc. A 0, 1711-1732 (1966). 

13. P. S. Hallman, D. Evans, J. A. Osborn, G. Wilkinson, Chem. 
Commun. (Camb.) 0, 305-306 (1967). 

14. R. R. Schrock, J. A. Osborn, J. Am. Chem. Soc. 98, 2134-2143 
(1976). 


oa 


25 May 2018 


15. D. A. Dobbs et al., Angew. Chem. Int. Ed. 39, 1992-1995 (2000). 

16. B. Abou-Khalil, Neuropsychiatr. Dis. Treat. 4, 507-523 (2008). 

17. J. Surtees, V. Marmon, E. Differding, V. Zimmerman, PCT 
international application patent WO 2001064637 Al (2001). 

18. C. R. Landis, J. Halpern, J. Am. Chem. Soc. 109, 1746-1754 
(1987). 

19. B. D. Vineyard, W. S. Knowles, M. J. Sabacky, G. L. Bachman, 

J. Weinkauff, J. Am. Chem. Soc. 99, 5946-5952 (1977). 


Dz 
20. D. Prat et al., Org. Process Res. Dev. 17, 1517-1525 (2013). 
21. X. Ma, M. Lei, J. Org. Chem. 82, 2703-2712 (2017). 
22. G. Hilt, Eur. J. Org. Chem. 2012, 4441-4451 (2012). 
23. D. K. Kim, J. Riedel, R. S. Kim, V. M. Dong, J. Am. Chem. Soc. 


139, 10208-10211 (2017). 

24. M. R. Friedfeld, G. W. Margulieux, B. A. Schaefer, P. J. Chirik, 
J. Am. Chem. Soc. 136, 13178-13181 (2014). 

25. T. J. Korstanje, J. |. van der Vlugt, C. J. Elsevier, B. de Bruin, 
Science 350, 298-302 (2015). 


ACKNOWLEDGMENTS 


We acknowledge J. Hill and M. Maust for assistance with high-pressure 
reactions conducted on large scale, A. Chin for determining solubility 
data for product isolation, and J. Gouker for assistance with chiral 
method development. Funding: Financial support was from a U.S. 
National Science Foundation (NSF) Grant Opportunities for Academic 
Liaison with Industry (GOALI) grant (CHE-1265988) between Princeton 
and Merck. M.R.F. acknowledges the NSF for a Graduate Research 
Fellowship (DGE-1148900). Author contributions: M.R.F., H.Z., and 
M.S. conducted all of the experimental work, interpreted results, 

and prepared the manuscript. R.T.R. and P.J.C. supervised the research 
and assisted with data analysis and manuscript preparation. 
Competing interests: None declared. Data and materials 
availability: Crystallographic data for compounds 2a, 3a, 3b, 4b, 
and 5 are available free of charge from the Cambridge Crystallographic 
Data Centre under reference numbers CCDC 1529292, 1529294, 
1586348, 1586349, and 1529293, respectively. All other 
characterization data are reported in the supplementary materials. 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/360/6391/888/suppl/DC1 
Materials and Methods 

Figs. Sl to S54 

Tables S1 to S30 

References (26-29) 


28 November 2017; resubmitted 13 February 2018 
Accepted 3 April 2018 
10.1126/science.aar6117 


5 of 5 


8102 ‘pe Aew uo /Bio Beweoualos soua!0s//:d}jy WO papeojuMOGg 


RESEARCH 


VALLEYTRONICS 


Imaging of pure spin-valley diffusion 
current in WS,-WSe, heterostructures 


Chenhao Jin,’* Jonghwan Kim,»?* M. Iqbal Bakti Utama,”? Emma C. Regan,’’* 
Hans Kleemann,’ Hui Cai,* Yuxia Shen,” Matthew James Shinner,* 
Arjun Sengupta,’ Kenji Watanabe,° Takashi Taniguchi,° Sefaattin Tongay,” 


Alex Zettl,””* Feng Wang”?”*+ 


Transition metal dichalcogenide (TMDC) materials are promising for spintronic and 
valleytronic applications because valley-polarized excitations can be generated and 
manipulated with circularly polarized photons and the valley and spin degrees of freedom 
are locked by strong spin-orbital interactions. In this study we demonstrate efficient 
generation of a pure and locked spin-valley diffusion current in tungsten disulfide 
(WSz2)-tungsten diselenide (WSez) heterostructures without any driving electric field. 

We imaged the propagation of valley current in real time and space by pump-probe 
spectroscopy. The valley current in the heterostructures can live for more than 20 microseconds 
and propagate over 20 micrometers; both the lifetime and the diffusion length can be 
controlled through electrostatic gating. The high-efficiency and electric-field-—free generation 
of a locked spin-valley current in TMDC heterostructures holds promise for applications 


in spin and valley devices. 


ransition metal dichalcogenides (TMDCs) 

offer a promising platform for applications 

in spintronics and valleytronics because of 

their distinctive electronic structure and 

strong spin-orbital interactions (1-13). At 
the K and K’ points of the Brillouin zone in 
TMDCs, two degenerate but inequivalent valleys 
are present, and these valleys can be used to en- 
code binary information. In addition, spin-orbital 
coupling leads to locked spin and valley degrees 
of freedom in TMDC monolayers. The locked 
spin-valley polarization of charge carriers can 
exhibit a very long lifetime because intervalley 
scattering requires a large momentum change 
and a flip of the spin simultaneously and is there- 
fore a rare event (14-18). 

An outstanding challenge in spintronics and 
valleytronics is to efficiently generate, transport, 
and detect pure spin-valley current, which will 
be crucial not only for understanding novel spin- 
valley physics but also for potential applications 
in charge-current-free devices with low power 
consumption (19, 20). In traditional spintronic 
systems, a transverse spin current is generated 
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through the spin Hall effect in materials with a 
large spin Hall angle, such as tantalum and tung- 
sten (27-23). Similarly, a transverse valley current 
has also been realized through the valley Hall 
effect in TMDC materials, but with low efficiency 
caused by a small valley Hall angle (73). In both 
cases a strong driving electric field is necessary, 
resulting in a dominant and unavoidable longi- 
tudinal electric current. 

In this study we exploited TMDC heterostruc- 
tures for efficient optical generation of a pure 
valley diffusion current without an external elec- 
tric field, which is accompanied by a pure spin 
diffusion current because of the spin-valley lock- 
ing in TMDCs. 

Figure 1, A and B, shows an optical microscopy 
image and a side-view illustration of a repre- 
sentative heterostructure device, respectively. 
A WSe,-WS, heterostructure (black dashed box 
in Fig. 1A) is encapsulated in two hexagonal 
boron nitride (hBN) flakes, with the ~40-nm-thick 
bottom hBN also serving as the gate dielectric. 
A few-layer graphene (FLG) back gate is used to 
tune the carrier concentration in the WSe2.-WS» 
heterostructure, and two FLG source and drain 
contacts (yellow dashed boxes in Fig. 1A) are 
symmetrically placed on two sides of the hetero- 
structure. All of the two-dimensional materials 
were first mechanically exfoliated from bulk crys- 
tals and then stacked together by a dry transfer 
method with a polyethylene terephthalate stamp 
(24). The whole stack was then transferred onto 
a 90-nm SiO,-Si substrate. 

We first characterized the gate-dependent op- 
tical transitions in the heterostructure by reflec- 
tion spectroscopy. The results are summarized 
in Fig. 1C for carrier concentrations ranging 
from -5 x 10”/cm? (hole doping) to 5 x 10/cem? 
(electron doping). Two prominent resonance 
features appear around 1.72 and 2.05 eV, corre- 


sponding to the A exciton energies in the WSe., 
and WS» monolayers, respectively. However, the 
WSe, and WS, resonances exhibit distinctively 
different gate dependencies. Resonances from 
WSe, vary substantially on the hole-doping side, 
showing both a decrease in the exciton response 
and an emergence of the trion response, consist- 
ent with the behavior of hole-doped monolayer 
WSe, (25, 26). Meanwhile, the WS. resonance 
shows only a slight red shift. The behavior on 
the electron-doping side is the opposite: only a 
slight red shift occurs in the WSe, resonance, 
but the WS, exciton and trion transitions vary 
markedly. This peculiar gate dependence can 
be understood from the type II band alignment 
(27-29) between WSe, and WS, (Fig. 1D): The 
conduction and valence bands in WS, are lower 
than the corresponding bands in WSe,. Conse- 
quently, electrostatically induced electrons and 
holes will stay only in the WS, and WSe, layers, 
respectively. 

We then generated a spin-valley current in the 
hole-doped heterostructures by local circularly 
polarized laser excitation (Fig. 2, A to D). This 
mechanism exploits the selective coupling of 
valley excitons to photon helicity, the ultrafast 
interlayer charge transfer process, and the ultra- 
long valley hole lifetime in WS,-WSe, hetero- 
structures. First, left-handed circularly polarized 
photons at 1.80 eV selectively excite K valley 
excitons in WSe,. The excited electrons transfer 
to the conduction band of WS, within the first 
~100 fs (28, 29), turning off the exciton valley 
depolarization channel (Maialle-Silva-Sham mech- 
anism) (30) and creating excess valley-polarized 
holes in the WSe, K valley (Fig. 2A) (18). After- 
ward, electrons in WS, recombine with holes in 
WSe, within ~100 ns. As discussed below, the re- 
combinations with K valley and K’ valley holes 
in WSe, have almost equal probabilities (Fig. 2B), 
which leads to an excess of K valley holes and 
a deficiency of K’ valley holes in WSe, with an 
ultralong lifetime of many microseconds (Fig. 
2C). The local imbalance of valley-polarized holes 
excited by a focused laser light can drive a pure 
spin-valley diffusion current (with zero net charge 
current) through diffusion in the heterostruc- 
ture (Fig. 2D). 

We performed spatial-temporal imaging of 
the spin-valley current in the WS.-WSe, hetero- 
structure by tracking the valley-polarized holes 
in the device with space- and time-resolved cir- 
cular dichroism spectroscopy. An elliptically 
shaped pump beam at 1.80 eV efficiently gen- 
erates spin- and valley-polarized holes in the 
heterostructure, and a second elliptically shaped 
probe beam at 1.70 eV probes their evolution 
in space and time by varying both the spatial 
separation and the temporal delay between the 
pump and probe pulses (24). Figure 2E displays 
the profile of the elliptically shaped pump and 
probe beams at the sample, each with a half 
width of ~1.5 um. The temporal delay At be- 
tween the pump and probe pulses is electron- 
ically generated by a data acquisition card with 
12.5-ns resolution, whereas the spatial separa- 
tion Av between the pump and probe pulses 
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along the width direction is controlled with 
~0.2-um resolution by tuning the angle of a mir- 
ror (24). The circular dichroic reflection contrast 
of the probe pulse directly measures the valley- 
polarized hole density in the heterostructure at 


> 
ize) 


a specific probe position and time (78). All ex- 
periments were carried out at 10 K. 

Figure 2F shows the measured evolution of the 
valley-polarized hole density in real space and real 
time for the hole-doped WS,-WSe, heterostructure 


at an initial electrostatic hole doping pp = 1 x 
10"°/cm?. The horizontal and vertical axes rep- 
resent the temporal and spatial separation be- 
tween the pump and probe pulses, respectively; 
the colors represent the amplitude of the circular 


few-layer graphene 
SiO, 


2 


Si 


Fig. 1. Gate-dependent optical transitions in the WSe2-WS> hetero- 
structure. (A and B) Optical microscope image (A) and side-view 
illustration (B) of the heterostructure device. Black and yellow dashed 
boxes in (A) mark the heterostructure region and FLG source and 
drain electrodes, respectively. Vc, gate voltage. (©) Doping-dependent 
reflection contrast of the WSes-WS> heterostructure. The dashed 

line indicates charge neutrality. The two prominent resonances around 
1.72 and 2.05 eV correspond to the A exciton energies in the WSes 


200 


400 600 800 


Delay time (ns) 


Fig. 2. Pure spin-valley diffusion current in hole-doped WSe2-WS2 
heterostructures. (A to D) Experimental scheme for valley current 
generation in hole-doped WSe2-WSo2 heterostructures. The dashed line 
represents the Fermi level before optical excitation. (A) Excitons in the 
K valley of WSeéo are selectively excited by circularly polarized pump 
light and efficiently converted into excess holes within ~100 fs through 
the ultrafast interlayer charge transfer process. (B) Electrons in WS2 
recombine with K valley and K’ valley holes in WSez2 with almost equal 
probabilities, resulting in an excess of K valley holes and a deficiency 
of K’ valley holes in WSes. (C) This valley-polarized hole population 
persists for tens of microseconds. (D) Local excitation of such valley- 
polarized holes can drive a pure valley diffusion current (with zero 

net charge current) through diffusion in the heterostructure. This pure 
valley diffusion current will be unidirectional if holes are excited 

at one edge of the device. (E to H) Spatial-temporal imaging of the pure 
spin-valley current. (E) Elliptically shaped pump and probe beam 
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and WS2 monolayers, respectively, which exhibit distinctively different 
doping dependencies because of the type II band alignment of the 
heterostructure. (D) Type II band alignment of the WSes and WS2 
interface. The conduction band minimum and the valence band 
maximum reside in WS2 and WSez, respectively. Electrostatically 
doped electrons stay only in WSo for the n-doping case (left panel), 
whereas holes stay only in WSe=> for the p-doping case (right panel). 
Er, Fermi energy. 
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profiles, each with a ~1.5-um half width. The long axis of the elliptical 
light (>50 ym) is much longer than the sample width (~10 um). Therefore, 
the illumination is nearly homogeneous along the long axis direction. 

(F) The spatial-temporal evolution of spin- and valley-polarized holes in the 
heterostructure with an initial hole doping po = 1 x 10'*/cm?. The hori- 
zontal and vertical axes represent the temporal and spatial separation 
between the pump and probe pulses, respectively. The colors represent the 
amplitude of the circular dichroism signal on a logarithmic scale, which 

is proportional to the valley-polarized hole density. The fast decrease in 
signal at small Ax values and the increase in signal at large Ax values 
signify the propagation of a valley diffusion current, which can be well 
captured by simulation from a diffusion-decay model (G). (H) Horizontal 
cuts from (F) (symbols) and fits from the diffusion-decay model (curves) 
over a longer time scale. The whole region within Ax = 8 um becomes 
homogeneous after 2 us because of the diffusive valley current and shows 
an overall slow decay afterward. 
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dichroism signal on a logarithmic scale. The sig- 
nal at the origin (corresponding to spatially and 
temporally overlapped pump and probe pulses) 
is normalized to 1. At zero time delay, the valley- 
polarized hole population is localized at the origin 
with a distribution matching the pump profile 
(half width of ~1.5 um), and the circular dichroism 
signal is negligible at pump-probe separations 
larger than 3 um. After a finite delay time, valley- 
polarized holes start to diffuse out of the excita- 
tion region, generating a pure spin-valley diffusion 
current. This leads to a strong decrease in signal 
close to the origin but an increase in signal in the 
region with large pump-probe spatial separation. 
In particular, a finite signal emerges at Av > 3 um, 
where no valley polarization is present from the 
direct pump excitation. As illustrated in Fig. 2F, 
the valley current can readily propagate to a dis- 
tance of more than 8 um within a time period 
of 800 ns. 

We can compare the measured spatial-temporal 
image of the valley-polarized hole density to dif- 
fusion theory and extract important dynamic 
transport parameters. The spatial-temporal evo- 
lution of the one-dimensional valley-polarized 
hole density Ap,(a, t) is described by a simple 
diffusion-decay model (24) 


A 3" 5 
Apy(a,) = Poe Bm 
T(og + 4Dt) 


where Apo is the total number of pump-induced 
spin- and valley-polarized holes, og is the half 
width of the pump beam, D is the hole diffusion 
constant, and t is the lifetime of valley-polarized 
holes (limited by a finite intervalley scattering 
time). The pump-probe signal S(Az, At) can be 
calculated by convolving Ap,(x, At) with the 
probe beam intensity 7(a) at a given pump-probe 
separation Ay (24). Figure 2G shows the sim- 
ulated pump-probe signal with a hole diffusion 
constant of D = 0.2 cm?/s and a valley lifetime of 
t = 20 us. The excellent agreement between the 
experimental and theoretical results confirms the 
diffusion-dominated dynamics of valley-polarized 
holes. Figure 2H shows several horizontal line 
cuts of Fig. 2F over a longer time scale (different 
curves are shifted vertically and successively by 
40% of the major tick separation for visual clar- 
ity; fig. S2 illustrates vertical line cuts). We ob- 


0 
200 400 600 800 0 200 400 600 
Delay time (ns) 


Delay time (ns) 


Fig. 3. Strong modulation of spin-valley current with electrostatic 
gating. (A and C) Spatial-temporal imaging of spin-valley current at 
initial hole concentrations of near zero (A) and 2.8 x 10!2/cm? (C). 

(B and D) Corresponding simulation results from the diffusion-decay 
model. The valley-polarized holes have negligible diffusion near the charge- 
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served that the whole region within Av = 8 um 
became homogeneous after 2 ts because of the 
diffusive valley current and exhibited an overall 
slow decay afterward. This dynamic is well cap- 
tured by the diffusion-decay model in Eq. 1 (curves 
in Fig. 2H). The valley lifetime of t = 20 us ex- 
tracted here is more than one order of magnitude 
longer than the earlier reported values (14-18). 
From the valley hole diffusion constant and val- 
ley polarization lifetime, we obtain a valley dif- 
fusion length of 1 = Dt = 20 um. In addition, 
the optical generation can produce a very high 
valley current density, reaching 2 x 10’ A/m” in the 
~1-nm-thick heterostructure at a valley-polarized 
hole density of 10”/cm? (24). 

The photo-generated valley diffusion current 
can be modulated substantially through electro- 
static gating. Figure 3, A and C, shows the spatial- 
temporal mapping of the valley-polarized hole 
density with the initial hole doping at ~O and 
2.8 x 10"’/cm?, respectively. We found that the 
valley-polarized hole density decays over time 
but maintains the same spatial profile for near- 
zero initial doping (Fig. 3A). This indicates that 
at the low-doping limit valley-polarized holes are 
largely localized and there is no valley current. 
In contrast, the valley diffusion current flow be- 
comes quite large for the highly hole-doped het- 
erostructure at 2.8 x 10/cm? (Fig. 3C), leading 
to a peak spin-valley diffusion current density of 
1.2 x 10° A/m? at a valley-polarized hole density 
of 10’/cm? (24). 

Figure 3, B and D, shows theoretical mod- 
eling of the valley hole diffusion at po = 0 and 
2.8 x 10"°/cm”, respectively. The extracted diffu- 
sion constants are summarized in Fig. 3E. From 
the diffusion constant, we can directly obtain 
the density-dependent hole mobility using the 
Einstein relation for a Fermi liquid at 10 K (24). 
The holes are almost completely localized at a 
doping level close to the charge neutrality point. 
Therefore, both the diffusion constant and the 
mobility are negligibly small. At larger carrier 
concentrations, the hole diffusion constant and 
the mobility both increase substantially, reach- 
ing D = 1.2 cm?/s and = 160 cm?/(V - s) at po = 
2.8 x 10'°/cm. The localization of holes at low 
carrier density signifies a mobility edge in TMDC 
materials, which may be related to Anderson lo- 
calization from potential fluctuations (37). Tra- 


D 


0 
800 0 50 100 
Delay time (ns) 


10 
Delay time (ns) 


ditional low-temperature transport measurements 
at such low carrier concentrations are quite chal- 
lenging or almost impossible because of an expo- 
nentially large contact resistance. In this study the 
spatially and temporally resolved optical imaging 
provided a contact-free approach to measure the 
intrinsic charge transport in TMDCs, and it can 
be a powerful and general tool for characterizing 
the transport behavior of two-dimensional sys- 
tems at low carrier concentrations. 

Like the hole diffusion constant, the valley- 
polarized hole lifetime also depends strongly 
on the electrostatic gating. Notably, the valley- 
polarized hole population (Ap, = Apg — Apx:), 
where Ap, and Ap, are photo-generated excess 
hole densities in the K and K’ valleys of WSez, 
respectively, can exhibit very different behav- 
ior from the total excess hole population (Apiot = 
Apx + Apx:). The different dynamic behaviors 
of Ap; and Ap, can be probed from the sum 
and difference responses of left and right cir- 
cularly polarized probe light, respectively (18). 
Figure 4A shows the population decay of the total 
excess holes in the heterostructure at Av = 0 
for different initial carrier concentrations, with 
the decay constants summarized in Fig. 4B (tri- 
angles). The decay lifetime of Ap,,, ranges from 
20 to 500 ns and becomes shorter at both greater 
electron doping and greater hole doping of the 
heterostructures. In contrast, the valley-polarized 
hole lifetime (circles in Fig. 4B) is the same as 
the total hole population lifetime for charge- 
neutral and electron-doped heterostructures but 
decouples and becomes much longer for hole- 
doped cases. 

The unusual dynamics of doping-dependent 
valley lifetime arises from the distinctive inter- 
layer electron-hole recombination process in the 
heterostructure, as illustrated in Fig. 4, C and D 
(24). For electron-doped or charge-neutral het- 
erostructures (Fig. 4C), all of the holes in WSe. 
are pump-generated “excess holes.” Therefore, 
when Apior decays to zero because of interlayer 
electron-hole recombination, no holes—and cer- 
tainly no valley-polarized holes—remain in the 
WSe,. In these cases, the valley lifetime is lim- 
ited by the total excess hole lifetime. The scenario 
is completely different for hole-doped hetero- 
structures (Fig. 4D). If the original hole density 
is much larger than the photo-generated one, 
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neutral point (A and B). In contrast, holes diffuse efficiently at large 
hole doping (C and D), creating a large spin-valley current density 

up to 1.2 x 10° A/m? at an optical excitation density of 10!2/cm?. 

(E) Extracted hole diffusion constant (red) and mobility (blue) show a 
strong dependence on the initial hole concentration. 
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Fig. 4. Doping-dependent lifetime of total excess hole and valley- 
polarized hole populations. (A) Decay dynamics of the total excess hole 
density at different initial carrier concentrations. The decay lifetime is 
longest near charge neutrality and decreases with both electron doping 
(positive carrier concentration) and hole doping (negative carrier concen- 
tration). (B) Comparison between valley-polarized hole lifetime (circles) 
and total excess hole lifetime (triangles). The valley lifetime is limited by 
the total population lifetime in charge-neutral and electron-doped hetero- 
structures; in hole-doped heterostructures the valley lifetime can be orders 
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Long-range exciton transport in 
conjugated polymer nanofibers 
prepared by seeded growth 


Xu-Hui Jin,’* Michael B. Price,”* John R. Finnegan,’ Charlotte E. Boott,’ 
Johannes M. Richter,” Akshay Rao,” S. Matthew Menke,” Richard H. Friend,”+ 


George R. Whittell,'; Ian Manners'+ 


Easily processed materials with the ability to transport excitons over length scales of more 
than 100 nanometers are highly desirable for a range of light-harvesting and optoelectronic 
devices. We describe the preparation of organic semiconducting nanofibers comprising 

a crystalline poly(di-n-hexylfluorene) core and a solvated, segmented corona consisting of 
polyethylene glycol in the center and polythiophene at the ends. These nanofibers exhibit 
exciton transfer from the core to the lower-energy polythiophene coronas in the end 
blocks, which occurs in the direction of the interchain n-x stacking with very long diffusion 
lengths (>200 nanometers) and a large diffusion coefficient (0.5 square centimeters per 
second). This is made possible by the uniform exciton energetic landscape created by the 


well-ordered, crystalline nanofiber core. 


he ability to transport excitation energy 

over length scales comparable to the opti- 

cal absorption depth (100 nm and beyond) 

is central to the function of a range of de- 

vices, including solar cells. Most thin-film 
organic semiconductor structures, such as those 
formed from conjugated polymers, show short 
exciton diffusion lengths (Zp) of around 10 nm (7) 
that are primarily constrained by energetic dis- 
order. As a result, these are fabricated as blends 
of electron donor and acceptor materials with 
length scales for the partly demixed materials 
targeted around 10 nm, so that all excitons can 
reach the charge-generating hetero-interface 
(1, 2). In contrast, diffusion ranges for singlet 
excitons in purified single crystals are known to 
be considerably larger (up to 220 nm) (J, 3-6). 
However, in these cases the materials are gener- 
ally polydisperse in dimensions and are proble- 
matic to incorporate into useful devices. Device 
development is thus dependent on the ability 
to develop uniform nanostructures that are ame- 
nable to processing and able to support long- 
range exciton diffusion. 

Self-assembly of molecular and polymeric 
amphiphiles in solution has recently emerged as 
a promising route to core-corona nanoparticles 
(micelles) suitable for device applications. For 
instance, long-range exciton diffusion has been 
observed in dye molecule H and J fiber-like ag- 
gregates (where H and J characterize a shift to 
shorter or longer wavelength upon aggregation, 
respectively) by single-molecule spectroscopy (7, 8) 
and in more detailed investigations performed on 
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single conjugated polymers isolated in matrices 
or dilute solution (9, 10). These latter studies, 
however, necessarily exclude the role of interchain 
transport in exciton diffusion. Such transport is 
possible in solution-processed bulk conjugated 
polymers, which have been reported to show a 
singlet exciton diffusion length of 70 nm (JJ), but 
this is still short of the lengths exhibited in molec- 
ular crystals. Here, we report exceptional exciton 
diffusion ranges in uniform nanofibers formed 
from the seeded-solution self-assembly of block 
copolymers with a crystallizable x-conjugated 
poly(di-n-hexylfluorene) (PDHF) block. 

To facilitate the study of exciton diffusion 
within n-conjugated crystalline PDHF, we pre- 
pared segmented nanofibers comprising a con- 
tinuous PDHF core with a discrete region of 
energy-accepting quaternized polythiophene 
(QPT) covalently attached as a corona at each 
end (Fig. 1, A and B). The central segment con- 
tained an electronically insulating poly(ethylene 
glycol) (PEG) corona, and the length of this re- 
gion was varied to afford a size series of nano- 
fibers. These segmented nanostructures were 
prepared using living crystallization-driven 
self-assembly, a recently developed seeded-growth 
method for producing one-dimensional (1D) and 
2D objects of controlled dimensions (72, 13). The 
resulting structures are generally typified by 
highly crystalline cores, which can comprise a 
number of building blocks (14-17). We, Faul, and 
co-workers have previously demonstrated the use 
of this method with polythiophene-containing 
block copolymers to yield fibers with a 2-conjugated 
core, which form ensembles with promising tran- 
sport properties but weak fluorescence due to 
aggregation-induced quenching (18). 

PDHF exhibits bright blue fluorescence in the 
solid state (19), in electrospun microfibers (20), 
and in polydisperse nanofibers self-assembled 
from triblock copolymers in solution (27). The 


uniform examples used in the present study, 
however, were prepared in a multistep approach, 
so as to achieve the desired segmented coronal 
structure and varied length of the PEG compo- 
nent. Initially, nanofibers ~5 to 10 um in length, 
derived from the block copolymer PDFH,-b- 
PEGo97 (where the numbers refer to the number- 
average degree of polymerization of each block; 
see figs. S1, S3 to S5, and S10), were prepared by 
homogeneous nucleation in a 10:8 mixture of 
tetrahydrofuran and methanol (THF:MeOH). 
Analysis by transmission electron microscopy 
(TEM), atomic force microscopy (AFM), and wide- 
angle x-ray scattering (WAXS) revealed the pres- 
ence of a crystalline PDHF core with a rectangular 
cross section (number-average width W,, = 12.9 nm, 
height = 4.5 nm) surrounded by a PEG corona 
(figs. S10 to S12). A solution of micelle seeds 
(number-average length L,, = 30 nm; polydispersity 
Ly/Ly = 1.03, where Ly, is the weight-average 
length; height = 4.5 nm) was then prepared by 
sonication of the multimicrometer-long fibers 
(figs. S13, A and B, and S14A). Subsequent ad- 
dition of different volumes of a solution of uni- 
meric (molecularly dissolved) PDHF,4-b-PEGoo7 
copolymer in THF led to the formation of uni- 
form nanofibers of controlled length (Fig. 1A and 
figs. S14 and S15). 

To create nanofibers with a segmented corona, 
we prepared an all-n-conjugated donor-acceptor 
diblock copolymer, PDHF,,-b-QPT 2, comprising 
the same PDHF core-forming block and a QPT 
corona-forming block (figs. S2 and S6 to S9). 
This material was added in a molecularly dis- 
solved unimeric state (in THF:MeOH 3:1) to the 
PDHF4-b-PEGo57 nanofibers in THF:MeOH 1:1, 
leading to growth from the two PDHF core termi- 
ni. The resulting uniform nanofibers had a B-A-B 
structure of controlled overall and segment length 
with a crystalline PDHF core present over the 
entire length, but with a x-conjugated corona- 
forming block located only on the terminal (B) 
segments (Fig. 1A). This was used to produce a 
size series of near-uniform nanofibers with L,, = 
180 + 40 nm, 300 + 70 nm, 505 + 100 nm, 945 + 
240 nm, and 1840 + 540 nm, with each QPT seg- 
ment comprising 35 to 120 nm of this length, as 
measured by TEM and supported by AFM (figs. 
$13, C and D, S16, and S17 and table S1). The di- 
mensional control is illustrated further in Fig. 2A 
(and fig. S18), which shows a laser scanning 
confocal microscopy (LSCM) image of model 
nanofibers with central A-segments of 1.6 um 
and QPT-corona-containing segments of 1.8 um 
in length. 

Figure 1C shows the absorption and photo- 
luminescence (PL) spectra of unsegmented 
PDHF,,4-b-PEGo27 nanofibers, the absorption 
spectrum of the QPT homopolymer (see fig. S19 
for emission spectrum), and the energy levels 
of the two conjugated species. The overlap of 
absorption and emission in the PDHF indicates 
a small Stokes shift, whereas the PL illustrates 
a high degree of vibronic structure. The good 
overlap of QPT absorption and PDHF emission 
gives a PDHF-to-QPT Forster transfer radius 
of 4 nm. 
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We performed further steady-state optical mea- 
surements on the PDHF;4-b-PEGo9, as both nano- 
fiber (of average length 435 nm) and unimer to 
probe the nature of exciton transport in the PDHF 
core. The nanofibers show more vibrational struc- 
ture (Fig. 1C) than the unimer (fig. S20), and there 


is no pronounced red or blue shift upon ag- 
gregation. This is consistent with a n-stacked 
polymer possessing a transition dipole along the 
polymer backbone (22), which would be expected 
to exhibit both H- and J-like aggregate character- 
istics (23, 24). The ratio of the Igo to Ip PL 
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Fig. 1. Formation of segmented PDHF nanofibers by multistep self-assembly. (A) Schematic 
diagram illustrating the seeded growth process and the structures of PDHFy4-b-PEG227 and 
PDHFy,4-b-QPT22. (B) Illustration of the segmented B-A-B nanofiber structure with separate donor 
and acceptor domains. (C) Normalized absorption of QPT homopolymer in THF:MeOH (1:1) (orange 
dashes), and unsegmented PDHF nanofibers (L, = 1605 nm) (blue dashes), and photoluminescence 
(PL) emission of unsegmented PDHF nanofibers (L, = 1605 nm) in the same solution (solid blue 
line). The lo-9 and Io_; peaks in the PL are at 425 nm and 455 nm, respectively. The inset shows the 
energy levels of the PDHF and QPT. 
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Fig. 2. Photoluminescence of segmented PDHF B-A-B nanofibers in solution. (A) LSCM image 
of the uniform segmented PDHF nanofibers with a crystalline PDHF core (blue emission) and two 
terminal segments with QPT coronas (orange emission in THF:MeOH 1:1). Values of L, for the 
central and terminal segments were 1.6 um and 0.9 um, respectively. (B) PL spectra of segmented 
PDHF nanofibers with different A-segment lengths, normalized to peak maxima. In each case, 

the solutions of segmented nanofibers (~0.5 mg/ml) in THF:MeOH 1:1 were excited at the PDHF 
absorption peak of 380 nm. Emission arising from direct excitation of the QPT in the 1605-nm 
sample was unresolved. 
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bands is larger in the nanofiber than in the uni- 
mer (fig. S20), indicating a decreased Huang-Rhys 
parameter and hence a smaller configurational 
relaxation in the excited state (19). 

We studied the energy transfer from PDHF 
to QPT for the aforementioned size series of 
segmented nanofibers; Fig. 2B shows the nor- 
malized PL when the PDHF is selectively excited 
(at 380 nm). We observed quenching of the PDHF 
peak relative to the QPT peak for average A- 
segment lengths below 775 + 150 nm. This in- 
dicates energy transfer from the A-segment PDHF 
core to the B-segment QPT corona, although 
reabsorption of the PDHF PL is expected at this 
concentration. Further evidence for energy trans- 
fer comes from the excitation PL scan (fig. S21), 
which maps the absorption (fig. S22), and from 
the change in PL quantum efficiency with A- 
segment length (fig. $23). The latter matches 
that of pure QPT (13 + 5%) below the critical 
length (775 nm) and approaches that for the un- 
segmented structure (73 + 10%, a very high value) 
beyond this length. 

Time-resolved PL measurements enable better 
quantification of energy transfer, because donor 
quenching kinetics are not affected by reabsorp- 
tion effects. We used time-correlated single pho- 
ton counting (TCSPC; instrument response time 
~300 ps) to access low-excitation density regimes, 
and transient grating PL spectroscopy to probe 
the 1- to 100-ps time scale (see supplementary 
materials). 

From TCSPC, the predominant natural life- 
time of the unsegmented PDHF,4-b-PEG»9, nano- 
fibers (with average length 435 nm) in solution 
is 430 ps (fig. S24), shorter than for the cor- 
responding unimer (700 ps), as expected for a 
J-aggregate. Figure 3A shows the transient grat- 
ing PL time slices of the segmented nanofiber 
with an A-segment core length of 775 nm. We 
see a reduction of the PDHF emission (~480 nm) 
and a concurrent rise in the broader QPT emis- 
sion at longer wavelengths (530 to 630 nm). 
Figure 3B shows the kinetics of this transfer. We 
note that the QPT emission must be integrated 
over a broad wavelength range to account for the 
slow energy transfer that excitations undergo 
within the QPT itself (fig. S25). 

We modeled the kinetics of the PDHF and 
QPT PL with a 1D diffusion model (25) that 
includes a contribution from Forster resonance 
energy transfer at the ends of the PDHF nano- 
fiber into the QPT (see supplementary materials). 
For the TCSPC data (fig. S26) on the 775-nm 
A-segment length nanofiber solution, the best 
fit gives a diffusion length of Lp = 210 + 100 nm 
and a diffusion constant of D = L3,/21 = 0.5 + 
0.2 cm? s”, with errors estimated from the poly- 
dispersity of the nanofiber solution and robustness 
of the fit. Such an exciton diffusion constant is 
higher than any currently reported for organic 
semiconductors (1). 

Figure 4, A and B, shows the transient grating 
PL kinetics for the PDHF and QPT for a range 
of shorter segmented nanofibers as a function 
of A-segment length. We see faster quenching 
of PDHF emission and faster rises in the QPT 
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emission as the A-segment length is decreased. To 
fit the transient grating data, and hence a larger 
sample of quenching lengths than are available 
to TCSPC measurements, we add a time-dependent 
exciton-exciton annihilation term to the diffusion 
model (see supplementary materials and figs. 
$27 and $28) to account for second-order decay 
at the higher excitation densities used in this 
measurement (between ~5 x 10” and ~10 x 
10” cm”). Using a global fit over multiple 
PDHF,-b-PEGo57 segment lengths and fluences 
(fig. S29), we obtain a best fit to our data for an 
exciton diffusion length of 380 nm. A residual 
analysis (fig. S30) confirms the robustness of 
these values for Lp greater than ~150 nm and less 
than ~600 nm. This agrees within error with the 
TCSPC result, adding support to our initial ob- 
servation of large diffusion coefficients. We 
consider that the diffusion length most likely 
falls toward the smaller end of these values, 
because of the possibility of a small amount of 
exciton-charge annihilation in the higher fluence 
measurements, as well as the potential for a 
small amount of intermolecular energy transfer 
(fig. S31). Nonetheless, these results taken to- 
gether indicate an exceptional diffusion constant 
for a conjugated polymer structure. 

In conjugated polymers, efficient exciton dif- 
fusion is correlated with an increased degree of 
structural order (4). Our results are consistent 
with this, as corroborated by the WAXS data 
(fig. S12), which show appreciable structural order 
in the unsegmented PDHFy4-b-PEG»97 nanofibers. 
This is supported by the pronounced vibronic 
structure and narrow spectral linewidths in the 
PL. Ultrafast transient grating PL measure- 
ments show evidence of both excitonic move- 
ment (during the instrument response time, 
~200 fs) and some small degree of localization 
[in the first 700 fs after photoexcitation (Fig. 4C 
and fig. S32)]. We believe that this arises from 
rapid migration of excitons from the disordered 
to the ordered regions (evident from the WAXS 
data) and that exciton diffusion then occurs with- 
in these ordered regions in the nanofiber core. 

Further understanding can be gained from 
the self-Foérster radius, which is a useful tool for 
quantifying exciton diffusion lengths (26, 27), 
although the description has limitations (28, 29). 
We calculate a self-Férster radius in our PDHF 
fibers of 2.5 + 0.2 nm (see supplementary ma- 
terials). This is large in part because the parallel 
alignment of polymer chains gives ideal dipolar 
orientations for energy transfer (we have set «”, 
our dipole orientation factor, to 1). From this 
radius we can calculate (27) a diffusion length, 
Lp = Re /d? ='75 + 15 nm, where dis the nearest- 
neighbor distance of 0.46 nm (fig. S12). Applying 
a Forster theory beyond the point-dipole approx- 
imation (29) would likely reduce this number 
slightly; however, this gives an estimate of ex- 
pected exciton diffusion length for an incoherent, 
nearest-neighbor hopping regime. The fact that 
we measure diffusion lengths beyond this value 
is clear evidence that excitons are not hindered 
by the presence of localization sites, nor by local 
energy minima (30) in their transport. This pro- 
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vides evidence of a remarkably uniform energy 
landscape in these materials, enabled by their 
structural order. This uniformity would also 
benefit transport properties through a narrow- 
ing of the excitonic density of states (37). 

Our measurement of a diffusion length greater 
than our estimate for nearest-neighbor Forster 
transport implies that we must look beyond this 
picture to explain our data. An example of a 
model that includes some degree of coherence 
but reduces to diffusive Forster transport as a 
limiting case is that proposed by Barford and 
Duffy (31. Their model uses a modified Red- 
field equation to show that a degree of interchain 
coherence in a polyfluorene film leads to an in- 
crease in the mean exciton hopping range, and 
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hence gives a larger diffusion length. This re- 
quires interchain overlap, and the factor of 2.5 
increase needed to account for our measured 
diffusion range gives a physically reasonable en- 
ergy transfer integral of around 20 meV. There- 
fore, our measured exciton diffusion rate is high 
because of a combination of a physical packing 
structure optimizing the self-Forster radius, a lack 
of energetic trap sites, and a coherent component 
to exciton motion. 

We have synthesized segmented nanofibers 
of controlled length with a PDHF core and con- 
taining a QPT corona at each terminus. Spectro- 
scopic measurements in solution show that these 
nanostructures exhibit long-range exciton trans- 
port on the critical length scale comparable to 
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Fig. 3. Transient grating PL spectra and kinetics of segmented PDHF B-A-B nanofibers. 


(A) Transient grating PL time slices of a segmented nanofiber solution (0.5 mg/ml) with an average 
A-segment length of 775 nm, showing energy transfer from the PDHF to the QPT acceptor corona. 
There is a decay of the core PDHF Io_; peak (which appears at 480 nm because of a filter cutting 
off the blue edge of the spectrum) due to exciton annihilation and quenching to the acceptor. This is 
accompanied by a concurrent rise in the broad QPT PL peak from 550 to 630 nm in the first tens 
of picoseconds (materials are excited with a 200-fs laser pulse at 400 nm with an equivalent 
excitation density of ~5 x 101” cm7%). (B) Normalized PL kinetics of PDHF decay and rise of QPT 
signal for the spectra shown in (A). The green line shows the PDHF signal (integrated from 430 to 
460 nm); the blue line shows the QPT PL (integrated from 530 nm to 630 nm). The solvent used 
was THF:MeOH 1:1. 
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Fig. 4. Size-dependent transient PL kinetics and corresponding diffusion length model fits. 
(A) Transient grating PL kinetics (squares) of PDHF PL signal (integrated from 430 to 460 nm) in 
segmented PDHF B-A-B nanofibers of different A-segment lengths. The PL decay time decreases 
with decreasing segment length, showing efficient transfer. Solid lines are example fits of a 1D 
diffusion model with a diffusion length of Lp = 340 nm. The system was excited with a 400-nm, 
200-fs laser pulse, at ~10'® cm? equivalent excitation densities for the samples in solution. (B) The 
corresponding PL kinetics (squares) of the rise in the QPT signal in the segmented PDHF B-A-B 
nanofibers, fitted with the same 1D exciton diffusion model (solid lines) and diffusion length in the 
PDHF of Lp = 340 nm. (C) Transient grating spectra of PDHF-b-PEG nanofibers in solution. Time 
slices show spectra at early times after excitation, with excitation density of ~5 x 10!” cm™*. Spectral 
red-shifting is present until ~200 fs, and the ratio of the first and second vibronic peaks continues 
to reduce until ~700 fs (fig. S32). The solvent used was THF:MeOH 1:1. 
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the optical absorption length in conjugated poly- 
mers, and that this is enabled by the high degree 
of structural order in the PDHF core. In context, 
a 200-nm-thick film (the depth of our diffusion 
length) of conjugated polymer of average absorp- 
tion coefficient (32) would absorb 98% of in- 
coming photons. Such diffusion lengths could 
enable light-harvesting devices that use these 
polymer structures as antennae coupled to photo- 
detector materials of limited absorption [such as 
monolayer transition metal dichalcogenides (33)] 
and would also enable much simpler bilayer de- 
sign of organic photovoltaics relative to those 
based on the bulk heterojunction. 
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COLD MOLECULE PHYSICS 


Building one molecule from a 
reservoir of two atoms 


L. R. Liu,»”? J. D. Hood,” Y. Yu,»”? J. T. Zhang,”?* N. R. Hutzler,?”?* 


T. Rosenband,” K.-K. Ni??4+ 


Chemical reactions typically proceed via stochastic encounters between reactants. 
Going beyond this paradigm, we combined exactly two atoms in a single, controlled 
reaction. The experimental apparatus traps two individual laser-cooled atoms [one 
sodium (Na) and one cesium (Cs)] in separate optical tweezers and then merges them 
into one optical dipole trap. Subsequently, photoassociation forms an excited-state 
NaCs molecule. The discovery of previously unseen resonances near the molecular 
dissociation threshold and measurement of collision rates are enabled by the tightly 
trapped ultracold sample of atoms. As laser-cooling and trapping capabilities are 
extended to more elements, the technique will enable the study of more diverse, and 
eventually more complex, molecules in an isolated environment, as well as synthesis of 


designer molecules for qubits. 


hemical reactions proceed through indi- 

vidual collisions between atoms or mole- 

cules. However, when performed in stochastic 

ensembles, the individual reaction proba- 

bilities are observed as averages. Crossed 
molecular beams reduce the thermal velocity dis- 
persion to probe elementary reaction processes 
based on single collision events, illuminating 
many aspects of reaction dynamics (1-4). In quan- 
tum degenerate gases, cooled to temperatures 
<1 uK, the quantum motional degrees of free- 
dom play a critical role in the reaction (5-7). Com- 
parisons of such experimental reaction rates with 
theoretical models currently underpin our under- 
standing of reactions at the most elementary level 
(8-10). 

To further improve the specificity and pre- 
cision of reaction steps (JJ-13), individual par- 
ticle control is needed, similar to pioneering 
atom-positioning experiments with scanning 
tunneling microscopes (/4), but untethered 
from surfaces. By controlling individual par- 
ticles via laser cooling and optical trapping, 
molecules may be constructed atom by atom 
while maintaining specific internal and exter- 
nal quantum states. 

We realized chemistry in the minimum num- 
ber regime, where precisely two atoms are 
brought together to form one molecule with the 
aid of a photon. We achieved this using movable 
optical tweezers, in which individual atoms of 
different elements [in this work, sodium (Na) 
and cesium (Cs)] are isolated, cooled, manipu- 
lated, and eventually combined into a single 
optical tweezer. With exactly two atoms in an 
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optical tweezer, we can observe their collisions. 
We can also perform single-molecule spectros- 
copy in the gas phase by optically exciting the 
atom pair on a molecular transition, realizing 
the chemical reaction Na + Cs — NaCs*. Sub- 
sequent imaging of Na and Cs fluorescence 
distinguishes between four possible experimen- 
tal outcomes: Both, only one, or no atoms are 
detected in the tweezer, the latter indicating that 
a reaction has occurred. We chose NaCs for the 
demonstration because it possesses a large mo- 
lecular fixed-frame dipole moment of 4.6 D (15), 
making it a strong candidate for a molecular 
qubit in a future quantum computing architecture. 
We began by preparing laser-cooled Na and 
Cs atoms at a few hundred microkelvin in over- 
lapped magneto-optical traps (MOTs) in a vac- 
uum chamber (10° Pa). The MOTs serve as cold 
atom reservoirs for loading single atoms into 
tightly focused optical tweezer traps (16). After 
loading, the MOTs are extinguished. A sche- 
matic of the apparatus is shown in Fig. 1A. The 
numerical aperture = 0.55 microscope objec- 
tive focuses two different wavelengths of light, 
700 and 976 nm, to waists of 0.7 and 0.8 um 
radius. Because of the difference in Na and Cs po- 
larizabilities, the '700-nm-wavelength light attracts 
Na and repels Cs, whereas 976-nm-wavelength 
light attracts Cs five times more strongly than does 
Na (i7), enabling us to manipulate Na and Cs 
independently (Fig. 2A). A typical trap depth of 
1 mK is achieved for 5 mW of tweezer power. 
When tightly confined identical atoms are 
illuminated with near-atomic-resonant light, 
light-assisted pairwise collisions result in either 
zero or one final atom in the trap (J6, 18). Single- 
atom loading succeeds approximately half of the 
time (19). However, the large light shifts for Na in 
a 700-nm-wavelength tweezer would normally 
prevent atom cooling and, consequently, effi- 
cient atom loading. We eliminated this light 
shift for Na by alternating the tweezer and cool- 
ing beams at a rate of 3 MHz (20). Subsequently, 


Na followed by Cs were imaged, and the polar- 
ization gradient was cooled to 70 and 10 uK, re- 
spectively. To determine whether an atom is in 
the optical tweezer, the fluorescence photoelec- 
tron counts from each atom in a region of in- 
terest (Fig. 1B) are compared with a threshold 
(Fig. 1C). The fluorescence histograms indicate 
that the cases of zero or one atom can be dis- 
tinguished with a fidelity better than 99.97%. We 
found that in 33% of cases, we loaded a single Na 
and a single Cs atom side by side. In 18% of cases, 
no atoms were loaded, and the rest of the time, 
either a single Na or Cs atom was loaded (Fig. 
1B). The experiment, which repeats at 3 Hz, re- 
cords initial and final fluorescence images in 
order to determine survival probabilities for dif- 
ferent stages of the molecule-formation process. 

Once single atoms have been loaded in sepa- 
rate traps, they need to be transported to the 
same location for molecule formation. Optical 
tweezers have been used to move single atoms 
while maintaining atomic internal state coher- 
ence (27) and to merge two indistinguishable 
atoms through coherent tunneling into one 
tweezer (22). We adiabatically transported and 
merged two different atoms, Na and Cs, into the 
same tweezer by using optical tweezers at two 
different wavelengths (Fig. 2A). The trap depths 
were adjusted by changing the beam intensities, 
and the positions were steered by applying dif- 
ferent radio frequencies to the respective acousto- 
optic deflectors (AODs) (Fig. 1A). 

For the merge sequence, the 700-nm tweezer 
containing Na was kept stationary, while the 
976 nm tweezer containing Cs was moved to over- 
lap the atoms (Fig. 2A, I to III). After the merge, the 
700-nm tweezer was extinguished adiabatically 
so as to leave both atoms in the 976-nm tweezer 
(Fig. 2A, IV). We designed this merge trajectory 
so that (i) Cs is deeply confined at all times and 
(ii) the double-well potential imposed on Na is 
sufficiently asymmetric to avoid a near-degenerate 
ground state. This process is time-reversible, which 
enables us to image the atoms separately and 
determine survival probability. 

Because the 700-nm tweezer is extinguished 
for 1 ms after the merge, while the 976-nm tweezer 
is always active, the Na atom escapes, unless the 
two tweezers are overlapped at the end of the 
merge sequence, whereas the Cs atom is always 
trapped. The result obtained when scanning the 
endpoint of the 976-nm tweezer trajectory is 
shown in Fig. 2B. The height of the Na survival 
peak at 0 um of 94(1)% is near the reimaging 
survival probability of 96% (numbers in parenthe- 
ses indicate the Wilson score interval). 

Having demonstrated adiabatic transport and 
merging of two species into a tight tweezer, we 
turned to their collisions. Isolated collisions 
between two atoms do not usually result in mol- 
ecule formation because of the need to simul- 
taneously conserve momentum and energy. 
However, the atoms can change their hyper- 
fine states after colliding, and the exothermic 
hyperfine-spin-changing collisions impart enough 
kinetic energy (~100 mK) to the atoms to eject 
them from the tweezer (=1 mK depth) (23). 
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Generally, a given initial trap occupancy can 
evolve into four possible outcomes after an ex- 
periment: (i) both atoms, (ii) no atoms, (iii) only 
Cs, and (iv) only Na remain in the trap. Single- 
atom images from each repetition allow us to 
post-select on any of these cases and separate 
one- and two-body processes, giving both life- 
times from a single data set (Fig. 3). For example, 
when Na and Cs are both present (effective pair 
density of m2 = 2 x 10” cm”) (24), and prepared 
in a mixture of hyperfine spin states, they are 
both rapidly lost: t>;, = 8(.) ms, where Tyg is 
the 1/e time of exponential decay. This yields a 
loss rate constant B = 5 x 10" cm*/s. By contrast, 
if the atoms are both optically pumped into the 
lowest-energy hyperfine levels, conservation of 
energy prevents the change of hyperfine states, 
and the atom lifetime increases to 0.63(1) s, 
which is similar to the rate of hyperfine-state 
relaxation for Cs owing to off-resonant scat- 
tering of the tweezer light (25). When only one 
atom is present, one-body loss due to collisions 
with background gas limits the lifetime to 5 s. 

Because of the rapid two-body loss for mixed 
hyperfine states, we optically pumped each atom 
into its lowest-energy hyperfine state in order to 
maintain a long-lived sample of cotrapped Na and 
Cs atoms. We then performed photoassociation 
(PA) of the atoms in order to form an excited state 
molecule, realizing a single instance of the chem- 
ical reaction Na + Cs > NaCs*. When illuminating 
the atoms with resonant PA light, an electronically 
excited state molecule may form (Fig. 4A) and then 
rapidly decay to the ground state. The molecule 
does not scatter imaging light, causing molecule 
formation to manifest as simultaneous loss of both 
Na and Cs atoms. These loss resonances are shown 
in Fig. 4B, bottom, as the frequency of the PA light 
is scanned below the dissociation threshold. 

Our optical tweezer architecture offers a num- 
ber of advantages for PA measurements over 
previous methods with bulk samples (26). The 
ability to precisely define the initial reagents 
eliminates contributions from other reaction 
processes such as Cs, formation or three-body 
loss. The combination of the high effective pair 
density n2 = 3 x 10” cm™ (24), afforded by the 
tweezer confinement, and high-PA light inten- 
sity of 3 kW/cm? yields fast PA rates. The high- 
contrast measurements of single-atom loss result 
in near-unity molecule detection efficiency and 
avoid the need for ionization detection (26). 

We scanned the 200-MHz frequency-broadened 
PA light from 30 to 250 GHz below the Cs atomic 
D2 line (68,2 - 6P3/2). We took steps of 200 MHz 
with 100-ms pulse duration and took an average 
of approximately 100 repetitions at each data point. 
An absolute accuracy of 1 GHz was set by the wave- 
meter. During PA, the Cs atom could be promoted 
into the upper hyperfine level because of off- 
resonant scattering of the PA beam, which would lead 
to spin-changing collisional loss. We counteracted 
this effect by simultaneously optically pumping Cs 
into the lower hyperfine level with a separate beam. 

The ability to detect molecule formation via 
atom loss with high efficiency allowed us to probe 
NaCs* vibrational levels near the dissociation 
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threshold, including resonances that have not 
been previously observed (Fig. 4). According to 
ab initio calculations of NaCs* with spin-orbit 
coupling (27), five molecular potentials converge 


(LB) dispersion model (29) to our observed reso- 
nances. Near threshold, the vibrational quantum 
number v' (v' = -1 is the highest bound state) is 
related to the binding energy by 


to the Cs (6P 35) + Na (3S1/2) asymptote (Fig. 4A): 
BI pee and b°Io-0. Of these, only the 
c?X, levels have previously been observed in the 1 


near-threshold regime (28), and our measurement Ey = C 1D 
agrees to within 1 GHz. To identify the vibra- 6 
tional progressions, we fit the LeRoy-Bernstein 
A = 

976 nmtweezer 


from laser 


852 nm 
fluorescence 


589 nm 
fluorescence 


imaging lens 


33% 


Occurrences 


Occurrences 


4 6 


8 10 12 14 23 45 6 7 


Photoelectron counts (10°) Photoelectron counts (10°) 


Fig. 1. Dual-species single-atom trapping and imaging. (A) Schematic of the setup. Optical 
tweezer atom-trapping beams (700 and 976 nm wavelengths) are independently steered by AODs, 
expanded by telescopes, and then combined on a dichroic mirror before being focused by the 
objective into a glass cell. Fluorescence from trapped Na and Cs atoms is collected through the 
objective onto the charge-coupled device camera. (B) Fluorescence images of single Na and 

Cs atoms. Length scale of 1 um is indicated. Cs (top) and Na (bottom) are imaged sequentially in 
the same field of view. The four possible cases are shown with their initial loading probabilities: 
no atoms, a single Na atom, a single Cs atom, or both Na and Cs atoms. Dashed blue (Cs) 

and orange (Na) boxes indicate the regions of interest for determining the presence of atoms. 
(C) Histogram of Cs (blue) and Na (orange) fluorescence. The bimodal distribution shows clear 
separation between zero- and one-atom peaks. Red dashed lines indicate the threshold that 

is used to determine the presence of an atom. 
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Fig. 2. Merging single Na and Cs atoms, A 
which are initially separated by 3 ym, into 
one tweezer. (A) One-dimensional cuts of the 
combined, time-varying 700- and 976-nm 
tweezer potentials for both atoms during the 
merge sequence. Na and Cs are represented by 
dots that track the minima of their potentials 
(orange, Na; blue, Cs). Overlaid are graphics of 
the optical tweezers. Radial trap frequencies are 
labeled in the first and last panels (axial trap 


rot 
N= Oo 


-2 [150 kHz ' ) 289 kHz 
0 ms 


frequencies are roughly 6 times smaller). | to Ill 
depict the merging process. In IV, the 7OO-nm 
tweezer has been extinguished, and only the 
976-nm tweezer remains. (B) Measured survival 
probability of Na and Cs after the sequence 


Trap Depth (mk) 


depicted in (A), followed by separating the 
tweezers through a reverse sequence to image 
the atoms. The two atoms are merged into 

the same tweezer at the survival maximum for 
Na. Error bars denote the Wilson score 
interval. The dashed lines represent the survival 
rates due to imperfect reimaging. 
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Fig. 3. Collisions of Na and Cs. The hold time in 
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the merged trap is varied so as to measure the 1 
evolution of trap occupancy owing to various collision 


occupancies allows us to distinguish one- and 
two-body processes. The fastest time scales 

are indicated next to the thick fitted curves. The 

fits are explained in the supplementary materials. 
(Left) For both atoms in a mixture of hyperfine states, 
the loss is dominated by rapid two-body 
hyperfine-state-changing collision-induced loss. 
(Middle) For both atoms in their lowest hyperfine 
states, the loss is explained by two-body 
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gives background gas limited lifetime of ~5 s for both atoms. We post-select on empty final tweezers, and markers denote initial trap occupancy. 


Fig. 4. PA spectroscopy of NaCs* A ts 


A) NaCs molecular potentials as 
a function of internuclear distance 
27). PA light excites the ground- 
state atoms to vibrational levels of 
the NaCs* excited molecular 
potentials, from which they mostly 
decay to vibrationally excited 
electronic ground-state molecules 
red wavy line). The long-range 
asymptotes of the excited state 
potentials (dominated by van der 
Waals interactions in the heteronuclear molecules) correspond to one of 
two cases: ground-state Na colliding with excited Cs in either the lower- 
energy 6P1,2 (D1 line) or higher-energy 6P3,2 state (D2 line). (B) The 
probability of single Na (orange), single Cs (blue), and joint Na*Cs (red) 
atoms evolving to the “no atoms” detection channel as the PA light is 
detuned from the Cs D2 line dissociation threshold at 351,730 GHz. When 
both atoms are initially loaded into the tweezer (bottom), two-body loss 
resonances appear because of molecule formation. As a validation of our 
method, we checked that no loss resonances were observed when only 
one atom was present (top). The positions of the loss resonances are 
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fitted with the LB dispersion model in Eq. 1 in order to identify three 
different potentials and fit the respective Cg dispersion coefficients. The 
expected resonance positions based on these fits are marked 

by vertical lines, as indicated in the legend. Except for v = —7, the root 
mean square deviation of the fitted dispersion curve from the measured 
frequencies are 0.3, 0.6, and 0.8 GHz for the c°X;, c°Zo, and B'M; states, 
respectively. At v = —-7, a crossing of molecular energy levels causes the 
measured spectrum to deviate from the prediction according to Eq. 1. 
Unassigned lines in the spectrum are likely due to rotational and hyperfine 
structure and predissociating potentials. 
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where u is the reduced mass, and / is the reduced 
Planck’s constant. We extracted the Cg dispersion 
coefficients that characterize the 1/r° component 
of the potentials and v'o, which is an offset be- 
tween -1 and 0. 

Fitting to the positions of our observed 
ed, resonances gives Up = -0.79 and Cg = 8.5(6) 
x 10? au (atomic units), which is in agreement 
with the theoretical value C; = 7.96 x 10° au (30). 
From the remaining loss resonances, we iden- 
tified two additional progressions (B'I, and 
Xj) with C; = 142(33) x 10 au and C; = 147(26) 
x 10* au (Fig. 4B). Both values are near the theo- 
retical value of Cs = 1.83 x 10* au (30). Our state 
labels correspond to the molecular wave func- 
tions in the near-threshold regime and differ from 
the labels in (30) because of an avoided crossing, 
as noted in (37). Here, the assignment of the 
3d, progression is based on previous observa- 
tion of the same resonances (28), whereas B'T], 
continues a previously observed sequence (37). 
The remaining progression corresponds to c? = 
because this is the only other compatible state. 
We interpret the PA spectrum as clear evidence 
for molecule formation because the resonance 
peaks appear exclusively as simultaneous loss of 
Na and Cs, and the resonance frequencies agree 
with independent measurements. 

Our technique can in principle be extended 
beyond the simple bialkalis demonstrated here 
and to produce deeply bound molecules. Mole- 
cules in a single-quantum state could be created 
through coherent transfer (32, 33) of atoms pre- 
pared in the motional ground state (34-37). Di- 
polar molecules trapped in a configurable array 
of optical tweezers (38, 39) would constitute a 
new type of qubit for quantum information pro- 
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cessing (40) and an important resource with which 
to explore quantum phases (41, 42). 
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Flexo-photovoltaic effect 


Ming-Min Yang,* Dong Jik Kim,* Marin Alexet 


It is highly desirable to discover photovoltaic mechanisms that enable enhanced efficiency 

of solar cells. Here we report that the bulk photovoltaic effect, which is free from the 
thermodynamic Shockley-Queisser limit but usually manifested only in noncentrosymmetric 
(piezoelectric or ferroelectric) materials, can be realized in any semiconductor, including 
silicon, by mediation of flexoelectric effect. We used either an atomic force microscope or 

a micrometer-scale indentation system to introduce strain gradients, thus creating very large 
photovoltaic currents from centrosymmetric single crystals of strontium titanate, titanium 
dioxide, and silicon. This strain gradient—induced bulk photovoltaic effect, which we call the 
flexo-photovoltaic effect, functions in the absence of a p-n junction. This finding may extend 
present solar cell technologies by boosting the solar energy conversion efficiency from a wide 


pool of established semiconductors. 


ince its first observation in the 19th century, 

the photovoltaic (PV) effect has been studied 

intensively for scientific interest and as a 

sustainable energy source to replace fossil 

fuels and reduce carbon emissions (J-3). 
In 1954, the first high-power modern silicon 
solar cells—in which the photoexcited carriers 
were separated by a built-in electric field de- 
veloped at a p-n junction—were invented (4). 
By virtue of the appropriate bandgap energies 
of the semiconductors in a p-n junction solar 
cell, sunlight is efficiently absorbed, resulting 
in considerable power conversion efficiency. At 
present, silicon PV cells are the mainstay of the 
modern solar industry, contributing more than 
1% of the global electricity supply. Nevertheless, 
PV cells based on p-n junctions have a photo- 
voltage limited by the bandgap energy of the 
constituent semiconductors and an ultimate ef- 
ficiency constrained by the Shockley-Queisser 
(S-Q) limit (5). 

Ferroelectric materials exhibit a PV effect, 
called the bulk photovoltaic (BPV) effect (6, 7), 
that is distinct from that of p-n junctions. Under 
uniform illumination, a homogeneous ferroelec- 
tric material gives rise to a current under zero 
bias [short-circuit current (/gc)] that depends on 
the polarization state of the incident light and 
produces an anomalously large photovoltage 
well exceeding its bandgap energy. This peculiar 
PV effect originates from the asymmetric distrib- 
ution of photoexcited nonequilibrium carriers 
in momentum space, caused by the absence of 
centrosymmetry in the material (7). Owing to its 
distinctive charge separation mechanism, solar 
cells based on the BPV effect can, in principle, ex- 
ceed the S-Q limit (8). However, a substantial BPV 
effect is generally found in wide-bandgap non- 
centrosymmetric materials such as ferroelectric 
BaTiO; (8) and BiFeOs (9), leading to an overall 
extremely low device efficiency under solar illumi- 
nation. Hence, one way to enhance the efficiency 
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may be to realize the BPV effect in semiconduc- 
tors with more favorable bandgaps, regardless 
of their crystalline symmetry. 

Flexoelectricity is an electromechanical prop- 
erty that reflects a coupling between an electric 
polarization and a strain gradient (JO, 11). In cen- 
trosymmetric materials, a strain gradient breaks 
its inversion symmetry, resulting in a polariza- 
tion with a preferred direction and enabling a 
piezoelectric composite containing no piezoelectric 
elements (11, 12). Likewise, one can hypothesize 
that a strain gradient allows the manifestation of 
the BPV effect in materials that were originally 
centrosymmetric. In this study, we propose and 
demonstrate that the BPV effect can be induced 
in any semiconductor by mediation of the flex- 
oelectric effect. Given that flexoelectricity is a 
universal property of all materials, ranging from 
biomaterials (13) to semiconductors and dielec- 
trics (J4) to two-dimensional materials such 
as graphene (15), this strain gradient-induced 
BPV effect, termed here the flexo-photovoltaic 
(FPV) effect, is possible for all symmetry classes. 
Thus, devices designed on the basis of the FPV 
effect can be fabricated with silicon or any other 
semiconductors. 

To demonstrate our idea, we explored the 
PV effect induced by a point force exerted onto 
the surface of centrosymmetric materials, includ- 
ing a SrTiOg single crystal and a rutile TiO, 
single crystal [section S1 of (16)]. The point force 
was exerted by the tip of an atomic force mi- 
croscope (AFM), inducing local inhomogeneous 
strain at the tip-surface contact area and, there- 
fore, a local breaking of centrosymmetry (/7, 18). 
In our experiments, we used a custom-made 
photoelectric atomic force microscope (Ph-AFM) 
consisting of an AFM-based system equipped 
with a customized current amplifier-filter system 
and an optical system (19). The optical system 
allowed us to illuminate a sample surface with 
a 405-nm laser properly polarized by a half- 
wavelength plate. A conductive AFM tip applied 
a local force on a sample surface and simulta- 
neously collected the resultant PV current. A 
brief schematic of the Ph-AFM setup is shown 
in Fig. 1, A and B. 


SrTiO, single crystals are ideal for studying 
the flexoelectric effect, owing to their simple 
cubic centrosymmetric lattices and large dielectric 
permittivities (74, 20). Unlike its sister material 
BaTiOs, ordinary crystals of SrTiO; do not exhibit 
the BPV effect because they have a center of in- 
version symmetry. However, with illumination 
around the contact area on a (001) face of a 
SrTiO, crystal, we observed that Isc exhibits a 
large transient peak as the loading force is in- 
creased from 1 to 18 uN (Fig. 1C). This peak is 
reproducible, proven by repeated exertion and 
withdrawal of the loading force. 

To show that the pronounced enhancement of 
Isc by the point force is not confined to SrTiO, 
crystals or a cubic structure, we investigated 
the force-induced PV current in a single crystal 
of rutile TiO, which is well known for its photo- 
electric applications such as dye-sensitized PV 
cells and photocatalysis. As in the case of SrTiOs, 
large negative [s- appears once a large force is 
exerted on a TiO, (100) face. Figure 1D shows a 
stable current under the 15-uN force, confirming 
that the force-induced PV effect is not a transient 
effect in this material. Although it depends on 
contact conditions and locations, the point force 
exerted by the AFM tip gives rise to a substantial 
current density (up to ~1 A/cm?) at the nanoscale 
contact area. This current density is more than 
three orders of magnitude higher than the Isc 
value (0.3 mA/cm?) obtained from a Schottky 
junction between TiO, and Pt under the same 
illumination condition [section S6 of (16)]. 

The current density increases by more than a 
factor of 100 when the loading force is increased 
from 1 to 15 uN (Fig. 1E). This phenomenon 
cannot be explained by expansion of the contact 
area with the loading force, because a contact 
area increases by, at most, a factor of ~6 witha 
15-fold increase in loading force in a simple 
elastic sphere contact model [section S2 of (16)]. 
Moreover, a 100-fold increase in contact area as 
a result of force applied by an AFM tip is not 
realistic. Notably, [sc has a negative value on a 
(100) face of the TiO, crystal but becomes posi- 
tive when the conductive AFM tip is loaded on a 
(001) face (Fig. 1F). The fact that the direction of 
the PV current depends on the crystallographic 
orientation of the TiO, crystal indicates that the 
observed PV effect cannot merely be attributed 
to a probable Schottky contact formed by the 
TiO, surface and the Pt coating of the AFM tip. 

The BPV effect is a potential origin of this PV 
current enhancement. As hypothesized, it is likely 
that a point force exerted on a crystal surface 
generates a local strain gradient, resulting in 
local centrosymmetry breaking and, thus, a local 
BPV current under illumination—i.e., the FPV 
effect—in the absence of a p-n junction and an 
appropriate band alignment. Given that both the 
flexoelectric response and the BPV effect depend 
on the crystallographic orientation (19-21), our 
results suggest that the strain gradient and the 
resultant FPV effect have a predominant role in 
the enhanced local PV current. 

The strain gradient induced by a sphere con- 
tact has a complex spatial distribution with very 
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large strain gradient values in an elastic mate- 
rial, as described in section S2 of (16). An AFM 
tip apex can be approximated as a hemisphere, 
and the distribution of strain gradient induced 
by the AFM tip can be calculated analytically 
with the Hertzian model and the Boussinesq’s 
calculation (22). Figure 2 shows the spatial dis- 
tributions of the zg component of the calculated 
strain and its derivative with respect to z under 
~15.7 uN of force with a 10-nm-radius contact 
area at the origin. The strain gradient is as large 
as 10’ m|. We found that the Jgc density and 
the volume beneath the AFM tip, which is sub- 
ject to a strain gradient larger than 1 x 10° m™, 
show a similar dependence on the exerted force 
[(Fig. 1E); the detailed process to obtain the 
relative volume can be found in fig. S9]. It is 
expected that the very large strain gradients 
induced by the AFM tip will break local sym- 
metry, thus leading to the manifestation of the 
BPV effect locally under illumination. However, 
deep theoretical considerations are required to 
understand the intricate relationship between 
the complex distribution of the strain gradient 
and the BPV properties. 

The main characteristic of the BPV effect is a 
periodic dependence on the angle between the 
PV current and the light polarization, stemming 
from its tensorial nature (27). For the present case, 
this dependence is predicted to be 


T 
Tppy = glolas + B,cos2a) (1) 


Where Jppy is the PV current, Jp is a light in- 
tensity, A, and B, are effective BPV coefficients of 
the locally deformed crystal, and a is the polar- 
ization angle of the incident light with respect to 
the top surface edge as described in section S3 of 
(16). The FPV effect should inherit the distinctive 
feature of the BPV effect and exhibit a sinusoidal 
dependence on the incident light polarization 
angle with a period of 180° Indeed, Jgc mea- 
sured by a conductive AFM tip in the configu- 
ration illustrated in Fig. 1B on the SrTiO; (010) 
and TiO, (001) surfaces exhibits a light polariza- 
tion dependence in accord with Eq. 1, as seen in 
Fig. 3. The sinusoidal behavior upon rotating the 
light polarization provides strong evidence that 
the underlying mechanism of the force-induced 
PV effect is the BPV effect generated by local sym- 
metry breaking due to inhomogeneous strain— 
namely, the FPV effect. 

The FPV effect should be neither confined to 
ionic crystals nor restricted to nanoscale geo- 
metries. First, we performed the same experi- 
ment (as depicted in Fig. 1) on an HF-passivated 
surface of a commercial p-type Si (001) crystal. As 
in our initial experiment, Js- increased by two 
orders of magnitude from ~5 pA with the 1-uN 
loading force to ~0.5 nA with the 15-uN loading 
force [Fig. 4A and section S7 of (16)]. Second, 
we demonstrated the FPV effect by using a home- 
built indentation system that deforms a semi- 
conductor by means of a conductive tungsten 
probe needle with a radius of ~10 um (fig. $1). 
Figure 4B shows crystallographic orientation- 
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Fig. 1. Force-induced PV effect in centrosymmetric SrTiO3 and TiO2 single crystals. 

(A) Setup for illumination around the contact area. The tip loading force is controlled by the 
feedback loop of an AFM. (B) Setup for illumination on the side surface. This illumination 
geometry was chosen to avoid the effect of Fresnel reflection and to ensure that light 
absorption would be independent of the light polarization. (© and D) Evolution of the 
photocurrent induced and collected by a conductive AFM tip with a high loading force on 

(C) a SrTiO3 (001) face and (D) a TiOz (100) face. (E) Loading force dependence of the 
induced photocurrent density and the relative volume subject to a strain gradient higher than 
1x 10° m7? (fig. S9). a.u., arbitrary units. (F) Positive photocurrent measured on a TiO2 (001) 
face with a 15-uN force applied by the AFM tip. 
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Fig. 2. Spatial distributions of strain and strain gradient induced by an ideal spherical 
indenter. (A) The z component of strain (e) and (B) its partial derivative with respect to z are 
shown. The contact area with a radius (r) of 10 nm is centered at the origin, and the force is 
pointing upward. The positive (or negative) value of the strain means a tensile (or compressive) 
strain along the z axis. 
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Fig. 3. Light polarization dependence of the 
force-induced photocurrent. (A and B) Sinus- 
oidal dependence of the photocurrent measured 
on (A) a SrTiO3 (010) face and (B) a TiO2 (001) 
face under 405-nm laser illumination. The red 
lines are fits to Eq. 1. Error bars indicate the SD 
of the PV current measured 25 times. 


dependent photocurrent-voltage U/-V) charac- 
teristics of the SrTiO; crystal acquired under 4 N 
of mechanical force exerted by the probe needle 
and 405-nm laser illumination directly to the 
probe contact area, as in Fig. 1A. These linear J-V 
characteristics are similar to those of ferroelec- 
trics under illumination (23). The oscillating 
PV current with the rotating light polarization, 
which is well fitted to Eq. 1 (Fig. 4C), demon- 
strates the FPV effect under the indentation 
force on the micrometer scale. The crystallo- 
graphic orientation-dependent PV current is also 
observed when the TiO, crystal is deformed by 
the indentation system (fig. S2). The persistence 
of the FPV effect from the nanoscale (AFM) to 
the micrometer level is promising for future 
device design and potential applications. The 
FPV effect is not related to a plasmonic effect 
found in the tip-enhanced Raman scattering: 
Only an atomically sharp tip coated with Ag or 
Au shows the plasmonic enhancement in the 
visible range (24), whereas we used platinum 
or tungsten probes ranging from the nanoscale 
to the micrometer scale. Likewise, a potential 
cubic-to-tetragonal phase transition induced in a 
SrTiO; single crystal under a large hydrostatic 
pressure (>6 GPa) should not play a large role 
because of the centrosymmetric nature of the 
induced tetragonal phase (25). 

We emphasize here four main features of the 
FPV effect. First, the separation of the photo- 
excited carriers in the FPV effect is controlled 
by the local symmetry and the resultant local 
BPV effect, in which the power conversion ef- 
ficiency can, in principle, exceed the S-Q limit 
(8). Second, to obtain a high photocurrent from 
any semiconductor, only a strain gradient gene- 
rator, such as a sharp probe with a sufficient 
loading force is necessary. This should be dis- 
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Fig. 4. FPV effect extended to covalent 
crystals and to the micrometer scale. 

(A) Force-induced photocurrent on a (001) 

Si crystal, as measured by Ph-AFM under 
illumination of the top indented surface. 

(B) Current-voltage characteristics measured 
on (001) and (111) faces of a SrTiO3 crystal 

by the microindenter applying a 4-N force under 
illumination of the top indented surface. 

(C) Light polarization dependence of the 
photocurrent on a SrTiO3 (010) face measured 
by the microindenter under illumination on 

the side surface. The red line is the fit of 
experimental data to Eq. 1. Because the 
polarization angle origin of the microindenter 
is not coincident with the Ph-AFM setup, the 
oscillating Is. has a phase shift compared with 
that depicted in Fig. 3B. Error bars indicate 
the SD of the PV current measured 50 times. 


tinguished from the previous reports that a 
strain gradient modifies a bandgap, but the 
charge separation in the photoelectric process 
still requires a proper band alignment (26) or a 
nanostructure (27, 28). Third, whereas the BPV 
effect is possible only in noncentrosymmetric 
materials, the FPV effect is universal. It is al- 
lowed by symmetry in all materials, owing to 
the universal nature of the strain gradient- 
induced centrosymmetry breaking. The FPV 
effect can be realized in ionic crystals (SrTiO; 
and TiO.) and covalent crystals (Si) but is also 


relevant to any semiconductors, ranging from 
organic-inorganic hybrid perovskites to semi- 
conducting polymers and even topological in- 
sulators. For instance, the topological insulator 
Bi,Te, with a centrosymmetric structure exhib- 
its the BPV effect by means of the flexoelectric 
effect (29). Finally, given that the BPV effect 
consists of asymmetric quantum mechanical 
processes such as photoexcitation, relaxation, 
recombination, and scattering, we demonstrate 
that one can readily control the quantum mech- 
anical processes by macroscopic tools such as an 
AFM tip and a probe needle. 

The configuration of our PV indentation sys- 
tem is very simple, and the FPV effect can be 
increasingly substantial with material dimensions 
decreasing into the nanoscale where flexoelectric- 
ity is more important (17). Thus, an valuable strain 
engineering route for improving the performance 
of solar cells and optoelectronic devices is now 
open. For example, a tandem solar cell can be 
easily fabricated by combining an array of in- 
denters and a conventional p-n junction, enabl- 
ing a higher efficiency because the FPV effect 
can be designed to boost the existing PV current 
generated by the buried p-n junction. Given that 
the lattice mismatch at the interfaces and crys- 
tallographic disorders in epitaxial and polycrys- 
talline thin film solar cells produce substantial 
strain gradients (30-32), the associated FPV 
effect would considerably enhance the perform- 
ance of these solar cells; however, this topic 
remains largely unexplored. In addition to in- 
organic solar cells, the FPV effect is also likely 
to play an important role in flexible and stretch- 
able electronics based on organic and polymeric 
semiconductors. Both the bending of flexible 
organic devices at the macroscopic level (33) and 
the folding and entanglement of the polymeric 
chains at the nanoscale would generate sizable 
strain gradients (34), which redistribute the 
electron cloud of x molecular orbits, modifying 
electronic transport and inducing the FPV effect 
under illumination (15, 35). But the details of 
how strain gradient and FPV effects influence 
nanoscale electronic properties remain a matter 
of debate. 
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High parasite diversity accelerates 
host adaptation and diversification 


A. Betts,'* C. Gray,” M. Zelek,’ R. C. MacLean,"} K. C. King’*+ 


Host-parasite species pairs are known to coevolve, but how multiple parasites 
coevolve with their host is unclear. By using experimental coevolution of a host bacterium 
and its viral parasites, we revealed that diverse parasite communities accelerated 

host evolution and altered coevolutionary dynamics to enhance host resistance and 
decrease parasite infectivity. Increases in parasite diversity drove shifts in the mode of 
selection from fluctuating (Red Queen) dynamics to predominately directional (arms 
race) dynamics. Arms race dynamics were characterized by selective sweeps of 
generalist resistance mutations in the genes for the host bacterium’s cell surface 
lipopolysaccharide (a bacteriophage receptor), which caused faster molecular evolution 
within host populations and greater genetic divergence among populations. These 
results indicate that exposure to multiple parasites influences the rate and type of 


host-parasite coevolution. 


oevolution between hosts and parasites 

is hypothesized to be central to numerous 

biological phenomena, from rapid evolu- 

tionary change (J) to the maintenance of 

diversity (2) and sexual reproduction (3, 4). 
Host-parasite coevolution is traditionally inves- 
tigated in a pairwise framework where a single 
parasite species infects one host species (5). How- 
ever, in nature, these pairwise relationships rare- 
ly exist in isolation (6, 7); hosts are often under 
attack by multiple parasite species (8), and para- 
sites must compete for hosts (9-12). Host-parasite 
coevolution thus operates within a network of 
species interactions (13). It nevertheless remains 
unknown whether and how diverse parasite 
communities can shape coevolutionary patterns 
and processes. 

Coevolution with highly diverse parasite com- 
munities should impose stronger selection for 
host resistance if this diversity yields more in- 
fections and higher mortality than single para- 
sites. This outcome could ultimately drive faster 
host evolution and increased divergence among 
host populations. To test these predictions, we 
applied an experimental coevolution approach to 
an in vitro bacterium-phage system. We exposed 
the host bacterium, Pseudomonas aeruginosa, 
to communities of one to five lytic viral parasites 
(bacteriophages PEV2, LUZ19, LUZ7, 14-1, and 
LMA2) that are obligate lethal parasites. These 
parasites each infect their hosts by attaching 
to one of three specific cell surface receptors 
[lipopolysaccharides (LPSs), Ton-B-dependent 
receptors, or type IV pili], and hosts typically 
evolve resistance by modifying or deleting these 
attachment sites (14). Parasites, in turn, can 
evolve reciprocal adaptations to circumvent host 
resistance (15). We experimentally coevolved 
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host populations independently in cell culture 
tubes with replicates of a five-parasite commu- 
nity (high diversity, n = 30 cultures), with every 
possible pairwise parasite combination (me- 
dium diversity, m = 30), and in a single-parasite 
treatment with replicates of each parasite in 
monoculture (low diversity, n = 30), alongside 
a parasite-free control (7 = 30). We tested for 
effects on the tempo and mode of host-parasite 
coevolution, conducting phenotypic assays of 
host resistance and parasite infectivity, as well 
as using deep sequencing to measure changes 
in the genomic compositions of host and parasite 
populations through time. Because we were able 
to experimentally manipulate parasite diversity, 
our work differs from the approaches used to 
investigate coevolution in natural communities, 
where diversity is not controlled exogenously 
(16, 17). 

As with predators and prey, parasite densities 
are expected to track host densities, producing 
time-lagged cycles in population sizes over time 
(18). In our experiment, given that the host was 
initially susceptible, we might have expected large 
amplitude oscillations in population size, unless 
host resistance rapidly evolved (19). By contrast, 
we observed that host densities increased through- 
out the experiment while parasite densities re- 
mained stable (Fig. 1A and supplementary text), 
indicating that underlying coevolutionary dy- 
namics interfered with the expected ecological 
dynamics (20). 

We determined whether coevolution took place 
in our experiment by performing 13,500 time- 
shift assays. This approach assesses the infectivity 
of parasites toward their past, present, and fu- 
ture hosts and, reciprocally, the resistance of 
hosts to their past, present, and future parasites 
(21). Resistance was measured by inhibition 
assays in which individual host genotypes were 
challenged with parasite communities and re- 
sultant growth or inhibition was analyzed to 
compare the mean levels of resistance among 
treatments. Infectivity and resistance varied wide- 


ly across individual host-parasite communities 
over time (fig. S1). Despite this complexity, two 
clear and significant patterns (F> 13044. = 1766.21, 
P < 0.0001) emerged. First, parasites from the 
future were more infective than parasites from 
the present or past (Fig. 1B). Likewise, hosts evolved 
increased resistance over time (Fig. 1C). This 
pattern is typical of arms race dynamics, com- 
monly seen with in vitro bacteria-phage co- 
evolution (5, 22). Second, bacterial resistance 
increased with parasite diversity and parasite 
infectivity decreased (Fo.s5 = 9.7, P < 0.001), 
suggesting that parasite communities impose 
stronger selection on their host than single 
parasites alone. 

To test the hypothesis that parasite diversity 
accelerated coevolution at a molecular level, we 
sequenced entire host and parasite populations 
longitudinally at multiple time points (ances- 
tor, midpoint, and end point) for the high- (n = 
12 cultures), medium- (7 = 10), and low-diversity 
(n = 15) and control (n = 10) treatments. We could 
thus detect any mutations that arose during co- 
evolution and estimate their frequency changes 
within populations. In the hosts, we found 4:74 
nonsynonymous and 75 synonymous polymor- 
phisms across 173 genes and 133 intergenic mu- 
tations. Parallel evolution with mutations in the 
same genes or genes in the same pathways was 
common across all treatments: >50% of muta- 
tions occurred in <5% of these genes. Notably, 
we observed parallel evolution in genes that are 
known phage receptors (table S1), including 
genes for LPS (n = 190 mutations), type IV pili 
(n = 69 mutations), and the Ton-B-dependent 
receptor (n = 55 mutations). To measure the rate 
of bacterial molecular evolution, we calculated 
the Euclidean genetic distance of each popula- 
tion from the ancestor (Fig. 2A). We found that 
higher levels of parasite diversity drove greater 
host divergence from the ancestor and conse- 
quently faster host evolution (fig. S2) [analysis of 
variance (ANOVA), F534. =10.5, P < 0.001; Tukey 
post hoc test, P < 0.05]. 

Two mechanisms could explain why diverse 
parasite communities accelerate the rate of host 
evolution. On one hand, diverse communities 
are more likely, by chance alone, to contain para- 
sites that drive rapid evolution. On the other 
hand, it is possible that parasite diversity per se 
accelerates host evolution by increasing selec- 
tion for resistance. To discriminate between these 
mechanisms, we calculated the contribution of 
parasite diversity to the speed of host evolution 
by adopting an approach developed to deter- 
mine the contribution of biodiversity to ecosys- 
tem function, represented in this study by the 
host evolutionary rate (23). Using this approach, 
we found that the combination of all five para- 
sites contributed to faster host evolution overall 
than phage monocultures (test for transgres- 
sive overyielding Dax > 0; one-sample ¢ tests 
with Bonferroni adjustment, P < 0.05). Thus, 
the diversity of interactions among parasites 
within communities, and not merely the pres- 
ence of a particular parasite, imposed stronger 
selection. 
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Whole-genome sequencing also revealed pro- 
found changes in parasite communities. At an 
ecological level, the composition of communities 
changed markedly over time (fig. S3). Parasite 
diversity decreased in both the medium- and 
high-diversity treatments, but diversity remained 
higher in the high-diversity treatment than in the 
medium-diversity treatment (fig. S4). At a ge- 
netic level, we found evidence for rapid molecular 
evolution of phage, which supports the results 
of our phenotypic assays. In total, we detected 
533 single-nucleotide polymorphisms (SNPs) 
across all five parasite types that reached a fre- 
quency of >10% in the parasite populations 
(table S2). The proportions of nonsynonymous 
SNPs were very high, exceeding 0.7 for each 
parasite (table $3), and parallel evolution was 
common, implying that these SNPs were pre- 
dominantly beneficial mutations. To investigate 
the role of parasite diversity in phage evolu- 
tion, we focused our analysis on communities 
that included the parasite PEV2 because this 
phage was well-represented at all levels of para- 


Fig. 1. Ecological and coevo- 
lutionary dynamics of host- xy 
parasite interactions. (A) We 
tracked the population dynam- 
ics of hosts (solid lines) and 
parasites (dashed lines) for high-, 
medium-, and low-diversity 
populations alongside the 
parasite-free control. All plotted 
points show the mean popula- 
tion density + SE. Bacterial 
populations challenged with 
phage rapidly recovered their 
density. We did not observe 
cyclical oscillations in host and 
parasite densities. CFU, colony- 
forming units; PFU, plaque- 
forming units. (B and C) To 
directly test for coevolution, we 
used 13,500 time-shift assays Ka 
to measure changes in phage 

infectivity and bacterial A 
resistance to phage. Lower 
levels of parasite infectivity (B) 
and higher levels of host 
resistance (C) evolved with 
increasing parasite diversity 
(Fo.g5 = 9.7, P < 0.001). Fur- 
thermore, hosts were more 
susceptible to parasites from 
the future than to contempo- 
rary parasites, and they were 
most resistant to parasites 
from the past. Likewise, future 
parasites were more infective 
than past or contemporary 
parasites (Fo 13044 = 1766.21, 

P < 0.0001). Error bars 

show 1 SE. 
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site diversity and because, compared with the 
genomes of the other phages used in this study, 
the PEV2 genome is well annotated. For exam- 
ple, all but two of the genes in the parasite 14-1 
genome code for hypothetical proteins. We de- 
tected 225 mutations in the PEV2 genome, and 
the rate of PEV2 evolution was independent 
of parasite diversity (fig. S5). Thirty-nine percent 
of the PEV2 mutations (n = 88) occurred in a 
single tail fiber gene, gp52. The tail fibers are 
the means by which the parasite recognizes and 
binds to its receptor and are known to undergo 
coevolution (24). We detected mutations in gp52 
and at least one host LPS mutation (usually in 
weg or migA) in every community containing 
PEV2 across all levels of parasite diversity (figs. 
S6 and S7). Collectively, these results provide 
genetic confirmation for reciprocal coevolution 
between host and parasite across all levels of 
parasite diversity, and they provide further evi- 
dence that parasite attachment to LPS is a key 
mechanism underpinning coevolution between 
PEV2 and P. aeruginosa. 


Coevolutionary dynamics are usually de- 
scribed as being one of two types. Arms races 
occur when selection results in directional in- 
creases in host resistance and parasite infec- 
tivity (25, 26), and with Red Queen dynamics, 
negative frequency-dependent selection causes 
fluctuations in allele frequencies (27, 28). It has 
been previously shown that each parasite follows 
a different coevolutionary trajectory when in- 
teracting with P. aeruginosa in a pairwise fash- 
ion (fig. S8) (29). Using a population genomic 
approach, we therefore sought to examine 
whether parasite diversity altered the mode 
of coevolution. 

We examined the type of coevolution occurring 
in each community by comparing the changes 
in frequency of each host allele between the 
mid- and end points of the experiment (29, 30). 
These time frames were previously shown to 
be suitable for observing coevolutionary dy- 
namics in P. aeruginosa-phage interactions (29). 
We focused our analysis on the host because 
changes in the frequency of phage alleles in 
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Fig. 2. Parasite di- 
versity accelerates 
host evolution. 

(A) Host allele 
frequencies after 

10 days of coevolution 
with the high-, medium-, 
and low-parasite- 
diversity treatments 
were used to calculate 
pairwise Euclidean 
distances between the 
ancestral sequence 
ANC) and each co- 
evolved population. 
ncreasing parasite 
diversity accelerated 
the rate of host evo- 
ution. (B) The genetic 
distances between 
coevolved populations 


PARASITE DIVERSITY 
ae HIGH 


EUCLIDEAN 
GENETIC DISTANCE 


were ordinated by nonmetric multidimensional scaling (MDS1 and MDS2). The ellipses represent a 95% confidence bubble around the means 
for the different treatments. We found evidence of divergence between populations within treatments (ANOSIM, R = 0.11, P < 0.01), and the greatest 
within-treatment diversification was observed in the high-parasite-diversity treatment. 


Fig. 3. Red Queen coevolution 
is more common in pairwise 
host-parasite interactions. We 
tested for Red Queen dynamics 
by regressing the change in 
host allele frequency (%) from 
day 5 to day 10 (y axis) with the 
observed frequency (%) on day 5 
(x axis). We found evidence for 
negative frequency-dependent 
selection on host alleles under 
low and medium parasite diver- 
sity, but not at high parasite 
diversity. The string of points 
forming a straight downward 
slope from zero in all three 
panels represents alleles 
observed on day 5 that had 
subsequently decreased in 
frequency to below our ability 0 
to detect them at day 10. 


nD w 
Oo Oo 


CHANGE IN ALLELE FREQUENCY 


diverse communities might additionally be driven 
by interspecific competition among parasites 
(14). On average, host alleles with higher fre- 
quencies tended to decline in frequency over 
time in the low- and medium-parasite-diversity 
treatments, consistent with negative frequency- 
dependent selection on host resistance genes 
(Fig. 3). This mode of selection became less pre- 
dominant at higher levels of parasite diversity 
[analysis of covariance (ANCOVA), Fo335 = 12.64, 
P < 0.0001], suggesting that the occurrence of 
Red Queen dynamics was reduced in these 
communities. 

In contrast to Red Queen dynamics, arms 
races should result in recurrent selective sweeps 
of resistance mutations (5). The effects of di- 
rectional selection in host-parasite interactions 
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LOW 
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have been studied in a variety of systems and 
can be important for coevolution of human 
pathogens with human populations (31-34). 
To test for selective sweeps in host popula- 
tions, we calculated Fs;, which measures the 
genetic divergence per mutation between each 
coevolved host population and a sequenced an- 
cestor (35), and we set a conservative thresh- 
old Fgy above which any mutation was deemed 
to be under strong directional selection (Fig. 
4A). Strong directional selection was more com- 
mon as parasite diversity increased (y” = 20, 
df = 3, P < 0.001), such that the number of 
fixed alleles in the high-diversity treatment 
was more than two times that observed un- 
der medium diversity (Fig. 4B). Of the 29 al- 
leles approaching fixation, 23 were in genes 


MEDIUM 


HIGH 


80 0 20 40 60 80 
OBSERVED ALLELE FREQUENCY AT MIDPOINT 


relating to LPS biosynthesis. LPS is very abun- 
dant on the bacterial cell surface, and phage 
may initially bind to LPS to anchor themselves 
to the cell before forming an irreversible at- 
tachment to a final receptor (36). Mutations that 
alter LPS biosynthesis may provide a general 
phage resistance mechanism. We have previous- 
ly shown that mutations in the LPS biosyn- 
thesis gene wzy provide resistance to four of 
the phages used in this study (1/4). Red Queen 
and arms race dynamics fall at two ends of a 
continuum (27): Tight interaction specificity 
is required for the former, and for the latter, 
general host resistance mechanisms should 
evolve to allow future hosts to resist all pre- 
vious parasite genotypes. Our data point to 
the availability of generalist LPS biosynthesis 
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Fig. 4. Parasite 
diversity leads 
to more arms 
race coevolu- 
tionary dynam- 
ics. (A) To test 
for arms race dy- 
namics, we cal- 
culated Fsr for 
all host SNPs 
across the 

P. aeruginosa 
genome for the 
high-, medium-, 
and low-parasite- 
diversity treatments 
and the parasite- 
free control. Each CO 
plotted point in 

the panel repre- A 
sents a single 

SNP, and host 


ae 


PER SNP F_. 
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genes that are known parasite targets, including genes involved in LPS biosynthesis, type IV pilus biosynthesis, and the Ton-B—dependent receptor, 
are shown in italics. (B) We used a conservative Fsy cutoff to identify selective sweeps of SNPs per host population (+ SE), and we found that increasing 


parasite diversity increased the rate of fixation of S 


resistance mutations as being key to the tran- 
sition from Red Queen to arms race dynamics 
as parasite diversity increases. That both dy- 
namics can arise from host-parasite commu- 
nity coevolutionary interactions is consistent with 
studies of bacteria-phage systems conducted in 
natural communities (16, 17). 

Parasites have been shown to play a key role 
in evolutionary diversification within (37-39) 
and among (40) host populations. Specifically, 
parasite-mediated selection against common 
host genotypes is predicted to stably maintain 
genetic diversity (31, 41). Host population ge- 
netic diversity should be greatest during pair- 
wise coevolution, where we observed that Red 
Queen dynamics are relatively stronger. Sur- 
prisingly, we found that levels of genetic diver- 
sity within host populations were not affected 
by parasite diversity (fig. S9) despite the differ- 
ent modes of selection acting across treatments. 
Parasite diversity, conversely, had a profound 
effect on allopatric diversification among host 
populations [analysis of similarity (ANOSIM), 
R = 0.11, P < 0.01]. The greatest divergence 
was found in the high-parasite-diversity treat- 
ment (Fig. 2B) (ANOVA, Fo213 = 122.15, P < 0.001; 
Tukey post hoc test, P < 0.05). The simplest ex- 
planation for this result is that rapid evolution 
within host populations leads to accelerated di- 
vergence among populations. 

Our study reveals that parasite communi- 
ties can form hotbeds of rapid antagonistic 
coevolution. Diverse parasite communities im- 
posed stronger selection on host populations 
than single parasites, causing faster selective 
sweeps of generalist resistance mutations and 
higher levels of host resistance. The multispe- 
cies coevolutionary process ultimately accel- 
erated the rate of molecular evolution within 
host populations and increased the genomic 
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divergence among populations. Host resist- 
ance to parasite infection can be specialized 
(26). However, broad-spectrum defense strat- 
egies that confer protection against different 
enemies are widespread in animals (42-44) 
and plants (45, 46), as well as bacteria (47-49). 
We should thus consider looking beyond the 
pairwise framework toward parasite commu- 
nities as drivers of coevolution and rapid host 
evolution in nature. This focus could be par- 
ticularly informative in biodiversity hotspots 
with elevated evolutionary and speciation rates 
(27), where a multitude of parasites might be 
the fuel. 
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MIGRATORY BEHAVIOR 


From local collective behavior 
to global migratory patterns 


in white storks 


Andrea Flack,»?*+ Maté Nagy,”***+ Wolfgang Fiedler,” 


Iain D. Couzin,”? Martin Wikelski?” 


Soaring migrant birds exploit columns of rising air (thermals) to cover large distances 
with minimal energy. Using social information while locating thermals may benefit 

such birds, but examining collective movements in wild migrants has been a major 
challenge for researchers. We investigated the group movements of a flock of 27 naturally 
migrating juvenile white storks by using high-resolution GPS and accelerometers. 
Analyzing individual and group movements on multiple scales revealed that a small 
number of leaders navigated to and explored thermals, whereas followers benefited from 
their movements. Despite this benefit, followers often left thermals earlier and at lower 
height, and consequently they had to flap considerably more. Followers also migrated 
less far annually than did leaders. We provide insights into the interactions between 
freely flying social migrants and the costs and benefits of collective movement in 


natural populations. 


n many animal taxa, migrations are per- 

formed by large social groups (J), providing 

various benefits to individual group mem- 

bers (2-6). In heterogeneous groups, co- 

ordinated movements frequently generate 
leader-follower patterns (7-10), and individu- 
als may adopt these behavioral strategies, pre- 
sumably without knowing their own or their 
group members’ roles (JZ). Given interindi- 
vidual variation within migratory groups, it is 
necessary to quantify the relationships be- 
tween each individual to fully understand group 
dynamics, social influences, and the resulting 
overall migration patterns. Despite existing knowl- 
edge from theoretical work (12-14), studying 
collective movement in wild migratory species 
has been a major challenge (15) because it is 
nearly impossible to record the simultaneous 
movements of freely flying animals in large, 
natural groups with appropriate spatiotemporal 
resolution. 

We approached this question by tagging 
large numbers of juvenile white storks (Ciconia 
ciconia) with high-resolution tracking devices 
(Fig. 1, A to E). We recorded the trajectories of 
27 GPS-tagged naturally migrating juvenile 
storks flying in a flock (together with untagged 
birds; table S1) over ~1000 km during the first 
5 days of their migratory journey. Owing to 
some birds eventually leaving the flock, the 
number of tagged birds in the flock was 27, 22, 
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21, 20, and 17 for the first 5 days, respectively. 
Using solar GSM (Global System for Mobile 
Communications)-GPS-accelerometer loggers, 
we recorded triaxial acceleration (at 10.54 Hz 
for 3.8 s every 10 min) and high-frequency GPS 
locations (at 1 Hz for 2 or 5 min every 15 min, 
synchronized in time between individuals; hence- 
forth, GPS bursts) of each individual during the 
group flights (Fig. 1, F to I). After these 5 days, 
we continued to monitor each bird’s movements 
throughout their entire lifetime, using GPS and 
accelerometer recordings at lower resolution 
(fig. SL). 

Similar to other large-bodied soaring mi- 
grants (16-19), white storks try to reduce the 
amount of energetically costly flapping flight 
by exploiting their atmospheric surroundings 
(20). When comparing movement activity among 
our tagged juvenile birds of the same flock, we 
found large differences in the amount of costly 
flapping. For each bird, we calculated a quan- 
titative measure of animal activity from tri- 
axial acceleration data (henceforth, flapping 
activity; see the methods) (Fig. 1F) (2/7, 22). Al- 
though storks flew in close proximity (figs. S2 
and S3), flapping activity ranged from 0.8 to 
1.8. Thus, to cover the same distance during the 
same time, some individuals performed con- 
siderably more flapping flight than did others. 
Flapping activity was not influenced by indi- 
vidual features (e.g., body measures or sex) 
or conditions before fledging (general linear 
model, Fig = 0.798, P = 0.671; table S2). Within- 
individual differences in flapping activity were 
stable across the different migration days 
(table S3). 

First, we examined how these differences 
in flapping activity relate to birds’ positions 
within the group. Exploring group structure 
in detail is challenging because of the differ- 


ent flight modes of soaring migrants (23, 24). 
To examine flock organization during all flight 
modes, we developed a metric that quantifies 
time advances or delays (At) between each 
pair of birds, allowing us to measure the time 
that separates two individuals—i.e., how much 
time a bird needs to reach the current loca- 
tion of the other bird (figs. S4 and S5). Storks 
with low flapping activity flew ahead of other 
flock members on average, whereas storks with 
high flapping activity flew behind (Pearson’s 
r = -0.778, n = 27, P = 1.7 x 10°; Fig. 2A). Next, 
we found that an individual’s position within 
the flock (Af) correlated with leadership (25) 
during the gliding segments (Pearson’s r = 
0.846, n = 27, P = 2.7 x 107°; Fig. 2B). Further, 
following birds tended to have higher flapping 
activity than did leaders (Pearson’s r = -0.770, 
n = 27, P = 2.6 x 10°: figs. S6 and $7). Because 
leader and follower roles are respectively re- 
flected in the front and back positions in the 
flock, we refer to birds that are ahead of the 
flock on average as leaders and those behind 
as followers (supplementary text and figs. S8 
and S9). 

Followers not only spent considerably more 
time flapping their wings, but also spent less 
time thermalling than did leaders (Pearson’s r = 
-0.688, n = 27, P = 7.2.x 10°; Fig. 2C). Followers 
finished thermalling earlier, at a lower altitude, 
likely to avoid being isolated from others—thus 
seemingly failing to exploit the full potential of 
thermals (fig. S10). In addition, followers flew 
farther behind, and at lower altitudes, than lead- 
ers during glides (Fig. 2D and figs. S11 to S13). 
Given that the tagged juveniles migrated to- 
gether with untagged storks, it is likely that the 
motion of the observed leaders was in fact af- 
fected by other, possibly more experienced, adult 
birds. Juveniles have higher flight costs than 
adults, but their ability to use thermals effec- 
tively improves throughout their journey (26). 
Collective movements may also partly arise from 
identical reactions to the same environmental 
features, but in this study we cannot distinguish 
between responses to environmental and social 
cues (27). 

Leaders and followers differed in their path 
“tortuosity” while flying within the thermals. 
Leading birds showed irregular circling while 
thermalling (calculated as the absolute value 
of the time derivative of the horizontal cur- 
vature, |d«/dt|), demonstrating that they make 
considerable adjustments to their flight paths, 
consistent with a need to locate the center of 
the complex thermal structures. In contrast, 
followers circled more regularly, indicating 
that, as theory has suggested (24), followers 
can benefit from social information to reach 
the center of thermals; At correlates highly with 
|dx/dt| for individual averages (Pearson’s r = 
0.570, n = 27, P = 0.002; Fig. 3A). Because every 
bird spent some time in the front and back half 
of the flock, we determined each individual’s 
thermalling performance when ahead and 
behind the center of the flock (Fig. 3B). Almost 
all exhibited more regular circling and faster 
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Fig. 1. Collective migration on different spatial scales, recorded using — standardized mean of daily overall dynamic body acceleration. Plots show 
accelerometers and high-resolution GPS. (A to E) Social interactions data for the birds with the lowest (sO1; top) and highest (s27; bottom) 
during migration (A) shape the global migratory route (B) by influencing flapping activity. (G) Five flock trajectories (1-Hz GPS bursts) of migrating 


small-scale navigational decisions [flight behavior between (C) and storks during thermalling and gliding flight. Bursts are shifted by 1 km 
within (D) thermals] and individual flight performance (E). Arrow color for visualization. Gray arrows indicate flight direction. Filled circles show the 
and size in (D) represent coarse-grained local air velocities estimated positions of all individuals at 2 min. Track color corresponds to flapping 
from the birds’ tracks (29). ACC, accelerometer. (F) Sample of triaxial activity (FA). (H and I) Enlarged view of the tracks marked in (G). The 
accelerometer data used to calculate flapping activity, defined as the third area marked in (G) is shown in fig. S4. 
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tive position of followers 

in a spherical coordinate system during gliding. We placed the focal bird (dark gray; one of the five 
birds with the lowest flapping activity) at the center of the coordinate system and measured the 
relative position of another bird (light gray; one of the five birds with the highest flapping activity) 
using the polar (8) and elevation angle (@) measured from the focal bird's horizontal flight direction. 
The probability density function (PDF) shows the relative locations of these birds when both were gliding 
and At € (2.5 s, 7.5 s). 
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Fig. 3. Derivative of curvature during thermalling flight for leaders and followers. (A) Relationship 
between the absolute value of the time derivative of horizontal path curvature (|d«/dt|) and At, 
averaged for each individual across the first migration day. Error bars, SEM. (B) |d«/dt], calculated 
using At as 1-s (crosses) and 10-s (bars) bins. Error bars represent the standard deviation of the 
mean (two-tailed t test, **P < 0.01, n = 27). (C) Relationship between |d«/dt| while flying ahead of 

(At > 0) and behind (At < 0) the average of the flock. Line, y = x. Error bars, SEM. (D) Example 
trajectories for illustrating the derivative of curvature. Of all bursts that contained thermalling and had a 
small wind drift, we chose one random example and depicted the most leading (blue) and following 
(red) individuals, identified on the basis of their At (highest and lowest value of that burst, respectively). 
(E) Horizontal (top) and vertical (bottom) movement components of these tracks. (F and G) Tracks 
color-coded to show At (F) and |d«/dt| (G) (s15 is shifted down for better visualization). |d«/dt| is also 
indicated by line width. 
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climb rates in thermals when following others 
than when flying ahead (paired ¢ test, n = 22, 
P = 0.030 and 0.018, respectively; Fig. 3C and 
figs. S13 and S14). 

Examining the complete migratory paths 
of the 27 birds (at lower temporal resolution) 
revealed considerable differences in migratory 
distance, with some birds remaining within 
Europe and others traveling several thousand 
kilometers to Africa (Fig. 4). Migratory dis- 
tance was strongly correlated with the birds’ 
migratory flight behavior; birds that exhibited 
a high proportion of (costly) flapping activ- 
ity migrated less far than birds that occupied 
frontal positions and exhibited low flapping 
activity when within the flock (Pearson’s 7 = 
-0.66, n = 20, P = 0.001; Fig. 4, inset). These 
differences in long-term migration behaviors 
can be predicted using only a few minutes of 
movement data from the flock’s first migra- 
tion day (supplementary text and fig. S15). Fur- 
thermore, flight time before migration (i.e., 
total number of GPS bursts in which each bird 
was found to be flying, before migrating) was 
also highly correlated with flapping activity 
(Pearson’s r = -0.648, n = 27, P = 2.6 x 104; 
fig. S16) and migratory distance (Pearson’s 7 = 
0.619, n = 20, P = 0.004). The differences in 
flight performance between leaders and fol- 
lowers suggest that juvenile storks may differ 
in their aerodynamic features and/or their 
behavioral strategies, which may affect their 
migration and group behavior over multiple 
scales. Nevertheless, birds can compensate for 
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Fig. 4. Relationship between migratory 
distance and flapping activity. Migration routes 
of storks during the first 4 weeks of migration. 
Tracks are color-coded on the basis of the 
overall flapping activity measured during the high- 
resolution data acquisition period (black dashed 
rectangle). The inset shows the relationship 
between flapping activity and maximum distance 
reached within 4 weeks. Color corresponds to 
flapping activity; open circles show birds that 
died within the first 4 weeks. 
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their inferior flight skills [e.g., lower glide ra- 
tio (ratio of forward speed to sink speed) and 
more flapping flight] by following others, which 
enables them to rise faster within thermals 
(figs. S13 and S14). 

Unlike storks, which form large groups with 
spatiotemporally dynamic structures, other spe- 
cies have been suggested to improve social in- 
formation usage by flying in V-formation (28). 
Although the number of studies that use ad- 
vanced tracking technologies to examine col- 
lective migration is increasing (3, 4, 29), the 
consequences of social behavior and social or- 
ganization are still largely unknown, especially 
in wild, freely moving animals. We identified 
two different behavioral strategies in a flock of 
migrating white storks, a finding that agrees 
with theoretical predictions (2). We unraveled 
mechanisms of collective migration in a natural 
environment by showing how local-scale leader- 
follower strategies emerge through a differential 
exploitation of the atmosphere. We suggest that 
integrating intraspecific interactions into the 
study of animal movements will enable a better, 
more mechanistic understanding of broad-scale 
ecological processes. 


REFERENCES AND NOTES 


1. E. J. Milner-Gulland, J. M. Fryxell, A. R. E. Sinclair, 
Animal Migration: A Synthesis (Oxford Univ. Press, 2011). 

2. V. Guttal, |. D. Couzin, Proc. Natl. Acad. Sci. U.S.A. 107, 
16172-16177 (2010). 

3. S. J. Portugal et al., Nature 505, 399-402 (2014). 

4. T. Mueller, R. B. O'Hara, S. J. Converse, R. P. Urbanek, 
W. F. Fagan, Science 341, 999-1002 (2013). 


Flack et al., Science 360, 911-914 (2018) 25 May 2018 


N. Chernetsov, P. Berthold, U. Querner, J. Exp. Biol. 207, 
937-943 (2004). 


A. Berdahl, C. J. Torney, C. C. loannou, J. J. Faria, |. D. Couzin, 


Science 339, 574-576 (2013). 


J. W. Jolles, N. J. Boogert, V. H. Sridhar, |. D. Couzin, A. Manica, 


Curr. Biol. 27, 2862-2868.e7 (2017). 
|. D. Couzin, J. Krause, N. R. Franks, S. A. Levin, Nature 433, 
513-516 (2005). 


A. Strandburg-Peshkin, D. R. Farine, |. D. Couzin, M. C. Crofoot, 


Science 348, 1358-1361 (2015). 


. A. Flack, B. Pettit, R. Freeman, T. Guilford, D. Biro, Anim. 


Behav. 83, 703-709 (2012). 
A. J. King, D. D. P. Johnson, M. Van Vugt, Curr. Biol. 19, 
R9OL-R916 (2009). 


. A.M. Simons, Trends Ecol. Evol. 19, 453-455 (2004). 
. E. A. Codling, J. W. Pitchford, S. D. Simpson, Ecology 88, 


1864-1870 (2007). 


. C. Torney, Z. Neufeld, |. D. Couzin, Proc. Natl. Acad. Sci. U.S.A. 


106, 22055-22060 (2009). 


. R. Kays, M. C. Crofoot, W. Jetz, M. Wikelski, Science 348, 


aaa2478 (2015). 

H. Weimerskirch, C. Bishop, T. Jeanniard-du-Dot, A. Prudor, 
G. Sachs, Science 353, 74-78 (2016). 

S. Sherub, G. Bohrer, M. Wikelski, R. Weinzierl, Biol. Lett. 12, 
20160432 (2016). 

G. Bohrer et al., Ecol. Lett. 15, 96-103 (2012). 

C. M. Bishop et al., Science 347, 250-254 (2015). 


. A. Hedenstrom, Philos. Trans. R. Soc. London B Biol. Sci. 342, 


353-361 (1993). 


. A.C. Gleiss, R. P. Wilson, E. L. C. Shepard, Methods Ecol. Evol. 


2, 23-33 (2011). 


. A. Flack et al., Sci. Adv. 2, e1500931 (2016). 
. C. J. Pennycuick, Ibis 111, 525-556 (1969). 
. Z. Akos, M. Nagy, T. Viesek, Proc. Natl. Acad. Sci. U.S.A. 105, 


4139-4143 (2008). 


_ M. Nagy, Z. Akos, D. Biro, T. Vicsek, Nature 464, 890-893 


(2010). 


. S. Rotics et al., J. Anim. Ecol. 85, 938-947 (2016). 
. N. W. Bode et al., Am. Nat. 179, 621-632 (2012). 
. B. Voelkl, J. Fritz, Philos. Trans. R. Soc. London B Biol. Sci. 372, 


20160235 (2017). 


29. M. Nagy, |. D. Couzin, W. Fiedler, M. Wikelski, A. Flack, 
Philos. Trans. R. Soc. London B Biol. Sci. 373, 20170011 
(2018). 


ACKNOWLEDGMENTS 


We thank all the people who helped during fieldwork, especially 
W. Schafle, R. van Noordwijk, B. Kranstauber, D. Piechowski, B. Eid, 
and Y. Flack. We thank W. Heidrich and F. Kiimmeth (e-obs, 
Munich, Germany) for their suggestions on logger programming 
and S. Davidson for setting up the Movebank Data Repository. 
Funding: We acknowledge funding from the Max Planck Institute 
for Ornithology. A.F. was supported by the German Aerospace 
Center (DLR) and the Christiane NUsslein-Volhard Stiftung. M.N. 
received funding from the Royal Society Newton Alumni scheme. 
|.D.C. acknowledges support from the NSF (IOS-1355061), the ONR 
(NO0014-09-1-1074 and NO0014-14-1-0635), the ARO (W9LING-11-1- 
0385 and W911NF14-1-0431), the Struktur- und Innovationsfunds fur 
die Forschung of the State of Baden-Wuirttemberg, and the Max 
Planck Society. Author contributions: A.F. and M.W. conceived the 
idea and designed the project; A.F., W.F., and M.W. conducted 
fieldwork and collected the data; M.N. designed the collective 
trajectory analyses and visualizations; A.F. and M.N. designed the 
data analyses and analyzed the data with contributions from |.D.C.; 
and A.F. and M.N. wrote the text with contributions from |.D.C. 
Competing interests: The authors declare that they have no 
competing interests. Data and materials availability: The data 
that were used for this study are deposited at the Movebank 
Data Repository (https://www.movebank.org/node/15294; 

DOI, 10.5441/001/1.bj96m274). 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/360/6391/911/supp|/DC1 
Materials and Methods 

Supplementary Text 

Figs. Sl to S16 

Tables S1 to S3 

Movies S1 to S3 


25 August 2017; accepted 17 April 2018 
10.1126/science.aap7781 


4 of 4 


8102 ‘pe Aew uo /Bio Beweoualos soua!0s//:djjy WO papeojuMOGg 


RESEARCH 


SYNTHETIC BIOLOGY 


An ingestible bacterial-electronic 


system to monitor 
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Biomolecular monitoring in the gastrointestinal tract could offer rapid, precise disease 
detection and management but is impeded by access to the remote and complex 
environment. Here, we present an ingestible micro-bio-electronic device (IMBED) for in situ 
biomolecular detection based on environmentally resilient biosensor bacteria and 
miniaturized luminescence readout electronics that wirelessly communicate with an 
external device. As a proof of concept, we engineer heme-sensitive probiotic biosensors 
and demonstrate accurate diagnosis of gastrointestinal bleeding in swine. Additionally, 
we integrate alternative biosensors to demonstrate modularity and extensibility 

of the detection platform. IMBEDs enable new opportunities for gastrointestinal 
biomarker discovery and could transform the management and diagnosis of 


gastrointestinal disease. 


icroorganisms living on and in the hu- 

man body constantly interrogate their 

biochemical surroundings and alter gene 

expression to adapt to changing envi- 

ronments. Synthetic biology enables 
the robust engineering of living cells with in- 
creasingly complex genetic circuits to sense 
biological inputs and control gene expression 
(1). Whole-cell biosensors harness this sensing 
ability to detect analytes associated with hu- 
man health (2) or environmental contamination 
(3). Owing to their innate robust functionality 
in complex physiological environments, bio- 
sensors have been developed to sense clinically 
relevant biomarkers in serum or urine ex vivo 
(4), as well as gut biomolecules supplemented in 
diet (5-7) or generated during disease (8-10). 
However, despite their promise as noninvasive 
diagnostics, biosensors have yet to be employed 
for clinically compatible testing in an unobtru- 
sive, real-time, and user-friendly way. Current 
research applications of ingestible biosensors 
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in animal models rely on cumbersome analysis 
of bacterial gene expression or DNA in stool 
samples (5-10), rather than real-time reporting 
from within the body. 

By contrast, the impressive scaling of semi- 
conductor microelectronics over the past few 
decades has delivered sophisticated, highly min- 
jaturized platforms for ultra-low-power sensing, 
computation, and wireless communication (11-3). 
In turn, these have enabled the recording of pa- 
tient compliance and the evaluation of the gas- 
trointestinal tract by using optical images, gases, 
temperature, and pH (/4-17). However, the abil- 
ity of electronics to directly and selectively sense 
biomolecules in vivo is limited by the availabil- 
ity of labile biochemical transducers and the 
size of the power-demanding circuits required 
to sense them. 

Here, we describe an ingestible micro-bio- 
electronic device (IMBED) that combines engi- 
neered probiotic sensor bacteria together with 
ultra-low-power microelectronics to enable in 
situ detection of gastrointestinal biomolecules 
associated with health or disease (fig. S1). By 
partitioning sensing to biological systems and 
computation and communication to electrical 
devices, IMBEDs leverage the natural advantages 
of each approach to enable ingestible gastro- 
intestinal diagnostics. In an IMBED, biosensor 
probiotics lie adjacent to readout electronics in 
individual wells separated from the outside en- 
vironment by a semipermeable membrane that 
confines cells in the device and allows for dif- 
fusion of small molecules. Sensing of target bio- 
markers by the bacteria generates light, which 
is detected by photodetectors embedded in 
the electronics. These electrical signals are pro- 
cessed by an integrated bioluminescence de- 
tection circuit (78) and are transmitted wirelessly 


from the device to an external radio or cellular 
phone for convenient readout. 

As a proof-of-concept IMBED for a clinically 
relevant biomarker, we developed a biosensor 
for gastrointestinal bleeding events via heme 
liberated from lysed red blood cells. Although 
cost-effective fecal occult-blood testing is avail- 
able (19), diagnosis of acute bleeding in the upper 
gastrointestinal tract often requires endoscopic 
observation (20), and IMBEDs could offer a rap- 
id, minimally invasive, and cost-effective means 
of detection. The heme biosensor was based on 
a synthetic promoter (Prato) (fig. S2A), regu- 
lated by the Lactococcus lactis heme-responsive 
transcriptional repressor, HrtR (27), and ChuA, 
an outer-membrane transporter from Escherichia 
coli 0157:H7 that allows for the transit of extra- 
cellular heme through the cell envelope (Fig. 1A) 
(22). Photorhabdus luminescens luxCDABE was 
used as the output of the genetic circuit as it 
functions at body temperature and encodes all 
components necessary for intracellular substrate 
production (23). The resultant prototype biosensor 
in laboratory E. coli (MG1655 V1) responded to 
increasing heme input with luminescence output 
with a signal-to-noise ratio (SNR) of 5.9 and a 
Hill function threshold (Kp) of 1 uM heme (fig. 
S$2B). Luminescence production was also in- 
duced by whole horse blood (fig. S2C), and 
lysis of red blood cells in simulated gastric fluid 
greatly improved sensitivity by liberating heme 
[Kp = 115 parts per million (ppm) blood] (Fig. 
1B and fig. S2D). Next, the prototype genetic 
circuit was iteratively optimized with the goal 
of improving SNR without compromising max- 
imum luminescence output (MG1655 V2; Fig. 1B 
and fig. S3). The final gene circuit was trans- 
ferred to the probiotic E. coli Nissle 1917 (Nissle 
V2), and the resultant strain responded rapidly 
to lysed horse blood (SNR = 310; Kp = 95 ppm; 
ty = 45 min) (Fig. 1B and fig. S4) as well as 
human blood (fig. $5). 

To examine functionality of the bacterial blood 
sensor in vivo, we evaluated whether bacterial 
blood sensors passing through the gut could 
detect upper gastrointestinal bleeding elicited 
by oral indomethacin administration (Fig. 1C) (24). 
At baseline, administration of the biosensors 
did not lead to detectable luminescence activity 
in stool, indicating that basal heme concentra- 
tions in the murine gut are insufficient to activate 
the gene circuit (fig. S6). After oral adminis- 
tration of indomethacin, blood sensor bacte- 
ria demonstrated 18-fold higher luminescence 
values in fecal pellets as compared to controls 
(Fig. 1C). Our designed biosensor can thus ef- 
fectively detect the presence of gastrointestinal 
bleeding in vivo. 

We next sought to integrate our bacterial bio- 
sensor with an electronic sensor and wireless 
transmission platform. Prior demonstrations of 
sensitive bioluminescence detection electronics 
have required external wiring and have been 
limited to bench-top assays (18, 25, 26), where- 
as a low-power wireless solution could enable 
convenient in vivo detection. We developed a 
miniaturized, fully integrated, wireless readout 
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Fig. 1. Probiotic E. coli can be engineered to sense blood in vitro and 

in vivo. (A) Schematic of the blood sensor gene circuit. Extracellular heme is 
internalized through the outer-membrane transporter ChuA and interacts 
with the transcriptional repressor HtrR to allow for expression of the bacterial 
luciferase operon /uxCDABE. (B) Dose-response curves of prototype (V1) and 
optimized (V2) heme-sensing genetic circuits in laboratory (MG1655) and 
probiotic (Nissle) strains of E. coli. Error bars represent SEM of three 
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independent biological replicates. (C) In vivo blood sensor performance. 
C57BL/6J mice were administered indomethacin (10 mg/kg) to induce 
gastrointestinal bleeding or vehicle (PBS, phosphate-buffered saline) and 
inoculated with blood sensor E. coli Nissle cells the following day. Normalized 
luminescence values of fecal pellets were significantly higher in mice 
administered indomethacin compared to control animals (*P = 0.04; Student's 
t test; N = 10 mice). CFU, colony-forming units; RLU, relative luminescence units. 


evaluation of IMBED for 
miniaturized wireless 
sensing with cellular biosen- 
sors. (A) Cross section, elec- 
trical system diagram, and 
front- and back-side photos of 
the device. PDMS, polydi- 
methylsiloxane. (B) System 
photocurrent response 
measured without cells. The 
incident photon flux was sup- 
plied by green light-emitting 
diode (wavelength a = 525 nm) 
and calibrated with an optical 
power meter (individual traces 
shown for N = 3 devices). 

C) Kinetic response of blood 
sensor IMBED in bacterial 
growth media supplemented 
with O and 500 ppm blood. 
(D) Dose-response of blood 
sensor IMBEDs in bacterial 0 
growth media containing dif- 0 
ferent blood concentrations 

2 hours after exposure. The 

leftmost data point represents 

the background response in the absence of blood. (E) Detection of multiple gut-relevant small molecules with IMBEDs. HrtR-, LuxR- 
and ThsRS-containing E. coli Nissle strains in IMBEDs were exposed to 500 ppm blood, 100 nM acyl-homoserine lactone (AHL), or 10 mM 
thiosulfate for 2 hours. In (C) to (E), error bars denote the SEM for three independent biological replicates conducted with different IMBEDs. 
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capsule for targeted sensing of small mole- 
cules in the gastrointestinal tract (Fig. 2A). 
The device combined our prior nanowatt-level 
time-based luminometer chip (78) with a micro- 
processor, wireless transmitter, and a set of 
phototransistors inside a molded capsule. Bio- 
luminescence from activated cells was detected 
by phototransistors located below each cavity. 
The detected luminescence was converted to 
a digital code by the low-power luminometer 
chip and transmitted wirelessly outside the 
body for calibration, display, and recording. 
The small button-cell battery inside the cap- 
sule (5 mAh) powered the device, and the mea- 
sured power consumption (table S1) suggests 
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a nominal device shelf-life of >9 months and 
active operation time of 1.5 months on a full 
charge. The electronic system was highly sen- 
sitive and captured photon flux as low as 5 x 
10* photons/s incident on the 0.29-mm? area 
of the detectors [white-noise-limited coefficient 
of variation (CV) 13%, Fig. 2B and fig. S8A]. 
The mean channel mismatch was less than 6% 
(CV) (fig. S7A), and mean temperature-induced 
drift across 5°C variation was less than 2 pA (fig. 
S7B). Additionally, the electronic system was sta- 
ble in simulated gastric fluid for up to 36 hours 
(fig. S7C), providing sufficient time to perform 
an ingestible measurement during gastrointes- 
tinal transit. 


To demonstrate integration of the ingestible 
luminometer capsule and engineered biosensors, 
we tested the probiotic blood sensor strains in 
an IMBED in vitro. Upon exposure to 500 ppm 
blood, induced bioluminescence could be observed 
in as soon as 30 min (Fig. 2C). The dose-response 
curve of blood sensor IMBEDs was similar to 
that of plate-reader measurements (SNR = 76; 
Kp = 135 ppm; compare Fig. 1B and 2D), with 
saturation achieved at 250 ppm and significant 
detection as low as 32.5 ppm blood (fig. S8) 
(Student's t test; P = 0.03). By combining cellular 
sensors with ultra-low-power electronic readout, 
IMBEDs serve as a flexible platform for sensitive 


detection of bleeding in fluidic environments. 
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Fig. 3. IMBEDs can rapidly 

detect porcine gastric 

bleeding. (A) Schematic of t 
the experimental flow, 
which consisted of blood 
administration in neutraliza- 
tion solution, capsule 
deposition, and wireless oe 
transmission to a commer- 

cial receiver connected 

to a laptop or a cellular 
phone. Representative endo- 
scopic (B) and x-ray 

C) images illustrate the 
ocation of the device in the 
stomach at the conclusion of 
our 2-hour experiments, 

just before device removal 
[scale bar (C), 5 cm]. 

D) Kinetic response of blood 
sensor IMBED in a porcine 
model of gastric bleeding. 
MBEDs deposited in gastric 
cavity can rapidly discriminate 
between pigs administered 
blood versus buffer control. 
Error bars denote SEM 

for six IMBED experiments 
(three animals on different 
days, two capsules per 
animal). *P < 0.05, Student's 
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t test. (E) Receiver operating characteristic (ROC) of IMBED sensing over time. Perfect detection is 


achieved at t = 120 min. AUC, area under the curve. 


In addition to blood sensing, we adapted 
IMBEDs to sense alternative biomarkers, thio- 
sulfate and acyl-homoserine lactone (AHL). 
Thiosulfate could serve as a biomarker of gut 
inflammation as it is elevated in murine models 
of colitis (8). AHLs are molecular signatures of 
particular bacteria, and their detection could 
indicate the presence of commensal or infectious 
agents in the gut microbiota (27, 28). Thiosulfate- 
and AHL-inducible genetic circuits were intro- 
duced into E. coli Nissle, and exposure to inducer 
elicited a dose-dependent bioluminescence re- 
sponse (fig. S9). These alternative biosensors 
were integrated with IMBEDs, and different 
analytes were readily detectable in a fluidic en- 
vironment (Fig. 2E). As additional biosensors 
of clinically relevant gut biomarkers continue 
to be developed, we anticipate that the breadth 
of potential analytes of the IMBED platform will 
continue to expand. 

To examine wireless in situ detection of bio- 
molecules with biosensors, we deployed blood 
sensor IMBEDs in a porcine model of gastro- 
intestinal bleeding. Prior to device deposition, 
pigs were administered a bicarbonate-glucose 
neutralization solution with or without 0.25 ml 
of blood (Fig. 3A). The neutralization solution 
helped buffer the low pH of the porcine gastric 
fluid as acidic environments degrade the func- 
tionality of the biosensor (fig. S10). The blood 
sensor IMBED was subsequently deposited into 
the stomach via orogastric tube and remained 
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resident and stable in the gastric cavity for the 
entire duration of the experiment (Fig. 3, B and 
C). Photocurrent data were wirelessly transmitted 
from the stomach over the course of 2 hours and 
logged by both a laptop computer and an Android 
phone equipped with a custom application for 
real-time data processing and visualization (figs. 
S11 and S12). The presence of blood in the por- 
cine gastric environment could be observed as 
early as 52 min (Student’s ¢ test; P < 0.05) and 
led to a fivefold increase in photocurrent after 
120 min as compared to animals given buffer 
alone (Fig. 3D and fig. S13). Luminescence 
production was not detected in biosensors lacking 
the ChuA heme transporter or the luciferase 
operon, indicating that observed light produc- 
tion was dependent on a functional genetic cir- 
cuit activated in the presence of heme (fig. S14). 
The receiver operating characteristic of the blood- 
sensing IMBED improved over time, with a sen- 
sitivity and specificity of 83.3% at 60 min and 
100% at 120 min (Fig. 3E). IMBEDs can thus 
detect small amounts of analyte in the harsh 
gastric environmental with high specificity and 
sensitivity. 

By combining the environmental resilience 
and natural sensing properties of bacterial cells 
with the complex data processing and wireless 
transmission afforded by ultra-low-power micro- 
electronics, we developed a device capable of in 
vivo biosensing in harsh, difficult-to-access envi- 
ronments. Using gastrointestinal bleeding as a 


proof-of-concept model system, we demonstrate 
strategies for genetic circuit design and optimi- 
zation, fabrication of an ingestible low-power, 
wireless luminometer, and validation of inte- 
grated system functionality both in vitro and 
in a large animal model. As the field of whole- 
cell biosensors matures, newly developed sen- 
sors of clinically relevant biomarkers could 
be rapidly integrated into an IMBED to per- 
form minimally invasive detection in the gastro- 
intestinal tract. With a test panel of candidate 
biomolecules, IMBEDs could enable studies 
of the biochemistry of anatomical regions that 
are traditionally difficult to access and could 
lead to the discovery of new clinical biomark- 
ers associated with health or disease. The in 
situ detection afforded by IMBEDs could also 
allow sensing of labile gut or microbiota-derived 
biomolecules that would otherwise be degraded 
before excretion in stool. Further integration 
of electronic modules, such as photodetectors, 
microprocessor, and transmitter, in a single 
integrated circuit could allow for further mini- 
aturization of IMBEDs as well as lower power 
consumption. Additional measurement chan- 
nels would also enable more precise biochemical 
readings, as the response of replicate biosen- 
sors within the same device could be averaged 
to mitigate the inherent variance of biological 
sensors as well as the heterogeneity of the com- 
plex gastrointestinal environment. Improved 
preparation of bacterial cultures for long-term 
storage, such as lyophilization, could be imple- 
mented to extend the shelf-life of fully assem- 
bled IMBEDs. Furthermore, the devices could 
be equipped with new orally delivered encap- 
sulation technologies to enable long-term resi- 
dency, monitoring, and anatomic localization 
in the gastrointestinal tract (29, 30). This inte- 
gration of biological engineering and semi- 
conductor electronics offers opportunities to 
transform diagnosis, management, and monitor- 
ing of health and disease. 
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RNA buffers the phase separation 
behavior of prion-like RNA 


binding proteins 


Shovamayee Maharana,' Jie Wang,'* Dimitrios K. Papadopoulos,’2* Doris Richter,’ 
Andrey Pozniakovsky,' Ina Poser,’ Marc Bickle,’ Sandra Rizk,”*? Jordina Guillén-Boixet,’ 
Titus M. Franzmann,' Marcus Jahnel,’* Lara Marrone,” Young-Tae Chang,” 

Jared Sterneckert,” Pavel Tomancak,’ Anthony A. Hyman,"{ Simon Alberti'+ 


Prion-like RNA binding proteins (RBPs) such as TDP43 and FUS are largely soluble in the 
nucleus but form solid pathological aggregates when mislocalized to the cytoplasm. What 
keeps these proteins soluble in the nucleus and promotes aggregation in the cytoplasm is 
still unknown. We report here that RNA critically regulates the phase behavior of prion-like RBPs. 
Low RNA/protein ratios promote phase separation into liquid droplets, whereas high ratios 
prevent droplet formation in vitro. Reduction of nuclear RNA levels or genetic ablation of RNA 
binding causes excessive phase separation and the formation of cytotoxic solid-like assemblies 
in cells. We propose that the nucleus is a buffered system in which high RNA concentrations 
keep RBPs soluble. Changes in RNA levels or RNA binding abilities of RBPs cause aberrant 


phase transitions. 


he intracellular environment is organized 

into membraneless compartments that have 

been termed biomolecular condensates 

because they form by liquid-liquid phase 

separation (1, 2). These condensates often 
contain RNA binding proteins (RBPs) with dis- 
tinctive domains, so-called prion-like domains, 
which are structurally disordered and contain 
polar amino acids (3) (Fig. 1A). Interactions be- 
tween prion-like domains and additional inter- 
actions between RNAs and RNA binding domains 
drive the assembly of prion-like RBPs by phase 
separation (4, 5). However, several prion-like 
RBPs, such as FUS, TDP43, and hnRNPAI, can 
also undergo an aberrant transition from a liquid- 
like state into solid aggregates that has been 
linked to neurodegenerative diseases such as 
amyotrophic lateral sclerosis (ALS) (4-6). One 
important aspect of these diseases is that ag- 
gregate formation is strongly associated with the 
subcellular location of the proteins. Aggregates 
in patient neurons are usually found in the cyto- 
plasm, whereas the nucleus is usually devoid of 
the aggregating proteins (7-10), although there 
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are some noteworthy exceptions (17). Disease- 
causing mutations frequently affect the nuclear 
partitioning of prion-like RBPs (12, 13), highlight- 
ing the importance of cytoplasmic mislocalization 
in disease. Protein mislocalization to the cytoplasm 
causes loss of function for the nucleus and gain 
of function in the cytoplasm, phenotypes that are 
thought to underlie disease (14-17). Importantly, 
genetic relocalization of FUS to the nucleus in 
yeast strongly decreases FUS toxicity (18). This 
suggests that the localization of FUS to the nu- 
clear environment suppresses its pathological be- 
havior, which raises two important questions: 
What prevents prion-like RBPs from forming solid- 
like aggregates in the nucleus? And why do these 
RBPs form aggregates in the cytoplasm? 

To answer these questions, we investigated 
the phase behavior of several prion-like RBPs 
(Fig. 1A). First, we determined the nuclear con- 
centrations of these proteins. The values ranged 
from 0.2 uM for TAF15 to 42.3 uM for hnRNPA1 
(Fig. 1, B to D, and supplementary methods). 
Next, we purified these proteins as green fluo- 
rescent protein (GFP) fusions and added them 
to a physiological buffer. At a concentration sim- 
ilar to the nuclear concentration (7.6 4M), FUS 
phase-separated into droplets (Fig. 1, E and F). 
This behavior contrasted with that in living cells, 
where only 1% of the nuclear FUS protein was 
contained in condensates (Fig. 1F), which are 
paraspeckles (19). The remaining 99% of nu- 
clear FUS protein was diffusely localized. Sim- 
ilar observations were made for TDP43, EWSR1, 
TAF15, and hnRNPAI (Fig. 1G, bottom panels). 
These results suggest that although the protein 
concentration is high enough for phase separa- 
tion in the nucleus, an additional nuclear factor 
prevents phase separation. 

We hypothesized that nuclear RNA could reg- 
ulate the phase behavior of prion-like RBPs. To 
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test this idea, we performed an in vitro phase 
separation assay with FUS in the presence of total 
RNA (Fig. 2A). In agreement with previous work 
(20-22), we found that small amounts of RNA 
promoted liquid droplet formation (Fig. 2B and 
fig. S1, A to D). RNA-containing droplets con- 
tained a higher FUS concentration than RNA- 
free droplets, and they appeared slightly more 
viscous (fig. S2, A to C). However, upon further 
increase in the RNA/protein concentration ratio, 
the droplets became smaller and finally dissolved 
(Fig. 2, A and B, and fig. S3). The addition of 
RNase A resulted in droplet reappearance (Fig. 
2D and figs. S4A, panels on the right, and S5), 
indicating that droplet solubilization depends 
on intact RNA. Similar results were obtained 
for EWSRI, TAF15, hnRNPAI, and TDP43 (Fig. 
2C). Thus, we conclude that high RNA/protein 
ratios prevent phase separation and that low 
ratios promote phase separation. 

We next tested whether different types of RNAs 
differ with respect to their abilities to dissolve FUS 
droplets. Individually, ribosomal RNA, tRNA, and 
a noncoding RNA that is known to bind to FUS 
(Neatl) were all able to solubilize FUS droplets, 
suggesting a general effect, but smaller RNAs 
were more potent than larger ones (fig. S4, A to 
D). Secondary structure was important for en- 
riching FUS in droplets, consistent with results 
in previous work (20), but secondary structure 
(fig. S4, A to E) and binding affinity (fig. S6) 
affected droplet solubilization only slightly. We 
next asked whether the cellular RNA concentra- 
tion is high enough to suppress phase separation 
of FUS. We estimated that the nuclear RNA 
concentration is ~10.6 times as high as that re- 
quired for droplet dissolution in vitro (fig. S7 
and supplementary methods). However, ~1% of 
nuclear FUS formed condensates (paraspeckles) 
(Fig. 1E) by binding to the noncoding RNA Neat1 
(19). To test whether Neatl could nucleate FUS 
droplets in the presence of a high background 
concentration of RNA, we added Neat1 RNA to 
a FUS sample that had been solubilized with 
tRNA. This led to a reappearance of FUS droplets 
(Fig. 2E and fig. S4F). We attribute this result to 
the ability of Neat1 to form large RNA assemblies 
(fig. S4C), which subsequently recruit FUS. This 
observation suggests that highly structured RNAs 
such as Neatl act as scaffolds that promote the 
nucleation of condensates in the high-RNA con- 
centration environment of the nucleus. A similar 
scenario may apply for stress granules in the 
cytoplasm, which contain large amounts of struc- 
tured polyadenylated mRNA (fig. S8). 

To test experimentally whether the high nuclear 
RNA concentration keeps FUS soluble, we micro- 
injected ribonuclease A (RNase A) into the nuclei 
of HeLa cells. Immediately after RNase A injection, 
FUS-GFP condensed into many liquid-like droplets 
(Fig. 3A, fig. S9, and movie S1), and this effect was 
not due to a general loss of nuclear integrity (figs. 
S10 and S11). As an alternative approach to de- 
crease the RNA/protein ratio, we injected purified 
FUS-GFP into the nucleus, which led to an imme- 
diate increase in the number and size of nuclear 
FUS assemblies (fig. S12). RNase A microinjection 
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Fig. 1. Prion-like RBPs phase-separate at their physiological concentra- 
tions. (A) Domain structure. PLD, prion-like domain; RRM, R 
motif; RGG, arginine- and glycine-rich region; ZF, zinc finger; 
localization sequence. (B) Representative images of immunostained HeLa 
cells. Dashed lines indicate the nuclear boundary. Scale bar, 5 um. 

(C) Quantification of the nuclear enrichment of RBPs. Error bars represent SD. 
(D) Calculated cellular and nuclear (Nuc) concentrations of RBPs in HeLa cells. 


into the nucleus also triggered rapid phase sepa- 
ration of hnRNPAI, EWSR1, TDP43, and TAF15 
(figs. S13 and S14). To investigate whether FUS 
forms complexes with RNA in living cells, we used 
fluorescence correlation spectroscopy (FCS). We 
identified two populations of FUS, one slow moving 
and one fast moving (details are in supplementary 
methods). We estimate that the amount of slow 
FUS in the nucleus is 10 times as high as that in 
the cytoplasm (Fig. 3, B to D; fig. S15, A to E; and 
supplementary methods). The fraction of slow 
FUS in the nucleus was decreased by the muta- 
tion of RNA binding domains in FUS (generating 
variants FUS-mutRRM/ZnF and FUS-mutRGG) 
and was further decreased by the removal of all 
RNA binding domains (generating variant FUS- 
PLD) (Fig. 3, E and F; and figs. S15, F to I, S16, and 
S17). These results indicate that a large fraction of 
nuclear FUS is complexed with RNA. To further in- 
vestigate the solubilizing role of RNA, we performed 
genetic experiments with transfected FUS-GFP- 
encoding plasmids. We observed that the number 
of nuclear FUS assemblies was directly propor- 
tional to the nuclear FUS concentration (Fig. 3G). 
We further found that FUS variants with a weaker 
capacity to bind RNA generally formed a higher 
number of assemblies (Fig. 3, H to J, and figs. S16 
and S17). Thus, reduced RNA binding directly 
affects the solubility and decreases the saturation 
concentration at which FUS phase-separates. 
We showed previously that FUS in vitro ini- 
tially forms liquid-like assemblies, but these ma- 
ture into more solid-like gels and fibrils over 
time (4). These solid-like states are reminiscent 
of pathological aggregates in ALS (8, 9). Thus, 
we next tested whether the addition of RNA 
prevents the formation of fibrils in vitro. The ad- 
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(E) Left, live HeLa cell nucleus expressing GFP-tagged FUS from a bacterial 
artificial chromosome (BAC). Arrows point to paraspeckles. Right, FUS-GFP 
phase-separated in vitro at 7.5 uM. Scale bars, 2 um. (F) Quantification of the 
fractions of FUS present in condensed and soluble states in vivo and in vitro. 
Error bars represent SD. (G) Top, HeLa cell nuclei expressing GFP-tagged 
RBPs from BACs. White arrows indicate condensates. Bottom, purified RBPs 
phase-separate at their respective nuclear concentrations. Scale bars, 2 um. 
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Fig. 2. RNA regulates the phase behavior of prion-like RBPs. (A) Representative images of purified 
FUS-GFP (5 uM) in vitro in the presence of total RNA. (B) Quantification of the fraction of condensed FUS-GFP. 
Cin, fraction of total protein in the droplets; Cou, fraction of total protein in the soluble phase outside the 
droplets. The value of FUS enrichment in the droplet phase in the absence of RNA was normalized to 1. Error bars 
represent SD. (C) In vitro phase separation assay with EWSRI1, TAF15, hnRNPAI, or TDP43 in the presence of 
total RNA. (D) Addition of RNase A to a sample of FUS-GFP (5 uM) solubilized with 300 ng/ul of total RNA. 

(E) Left, FUS-GFP (5 uM) solubilized with 800 ng/ul tRNA in vitro. Right, FUS phase separation triggered by the 
addition of 100 ng/ul Neatl RNA in the presence of tRNA (800 ng/ul). Scale bars in (A), (C), (D), and (E), 2 um. 
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dition of RNA kept the droplets in a soluble state, 
and fibers were not seen (Fig. 4A). We next inves- 
tigated whether RNA also changes the material 
properties of FUS assemblies in vivo. We set up 
an in vivo aging assay in which we microinjected 
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RNase A into HeLa cells and then monitored 
the dynamics of the liquid-like drops. After about 
30 min, the FUS drops no longer fused (Fig. 4, B 
and C, and movie S2) but stuck together in large 
clusters, similar to phenotypes seen previously 
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regions of FCS measurements. Scale bar, 5 um. (C) Autocorrelation curves 
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autocorrelation function, which is normalized to the amplitude of 1 at t= 10 us. 
(D) Quantification of the amount of slow FUS (methods are described in 
supplementary materials). Error bars represent SD. **P < 0.01. (E) Autocor- 
relation curves obtained from FCS of FUS-GFP variants in the nucleus. wt, 
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Fig. 4. RNA regulates aberrant liquid-to-solid phase transitions of L 
prion-like RBPs. (A) In vitro phase-separated Gly°>Glu (G156E) variant 
FUS-GFP in the absence or presence of total RNA after 24 hours. Scale bar, 
2 um. (B) FUS-GFP-expressing HeLa cell nucleus after RNase A micro- 
injection. Scale bar, 1 um. (C) Montage of FUS-GFP droplets formed after 
RNase A microinjection. The droplets fuse in the first 5 min (blue box) but 
dissociate after 30 min, resulting in “sticky droplets” (red box). (D) Montage of 
FUS-GFP droplets formed in vitro (7 uM). The fusion of freshly formed droplets 
is compared with 3-hour-old droplets. Scale bar, 5 um. (E) Fluorescence 
recovery after photobleaching (FRAP) of nuclear FUS-GFP assemblies less 
than 5 min (blue box) or more than 30 min (red box) after RNase A micro- 
injection. (F) FRAP of nuclear FUS-GFP assemblies in HeLa cells after RNA 
degradation as shown in (B) and (E) (n > 10 cells). (G to I) FRAP of nuclear 
assemblies in HeLa cells expressing full-length FUS [(G) and (1)] or FUS-PLD 
(H). The cell in (1) was also treated with F22. Scale bars, 1 um. (J) Mobile 


in vitro (Fig. 4D and movie S3). A change in the 
material properties was also evident from photo- 
bleaching experiments (Fig. 4, E and F, and fig. 
$18, A and B). Similar results were obtained for 
TDP43, but the transition was much faster (fig. 
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S19 and movie S4). We next used a genetic ap- 
proach to test how RNA binding affects the dy- 
namics of FUS in vivo. Complete abrogation of 
RNA binding resulted in a marked decrease of 
mobile FUS (Fig. 4, G, H, and J, and fig. S18C) and 
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fraction of photobleached assemblies in (G) to (I) (n > 15 cells). Error bars 
represent SD. (K) Three-dimensional (3D) rendering of FUS-PLD nuclear 
assemblies. The insets show aberrant “sticky droplets.” Scale bar, 1 um. 

(L) Time series to track the lifetime of FUS-GFP HeLa cells. H2B-mCherry 
was used to detect cell death. Scale bar, 5 um. (M) Quantification of the 
fraction of cells undergoing cell death. Error bars represent SD. *P < 0.05, 
**P < 0.01, and ***P < 0.001 in comparison with the wild type. 


the formation of sticky droplet clusters (Fig. 4K). 
Lastly, we used a chemical approach with the dye 
F22 to reduce RNA binding (23). In F22-treated 
cells, the fraction of RNA-bound FUS was strongly 
diminished (fig. S20 and supplementary methods), 
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and this caused a strong reduction in the mobile 
fraction of FUS (Fig. 4, I and J, and fig. S18, E 
and F). Together, these findings show that RNA 
keeps condensates formed by prion-like RBPs 
in a dynamic state and prevents the formation of 
solid assemblies that can cause disease. 

To investigate how reduced RNA binding af- 
fects cell viability, we transiently transfected HeLa 
cells with wild-type and mutant FUS and moni- 
tored cell survival. Expressing a nuclear variant 
with reduced RNA binding (FUS-mutRRM/ZnF) 
affected the rate of cell death only slightly (Fig. 4, 
L and M, and fig. S16), presumably because the 
high nuclear RNA concentration compensated 
for the genetic defect. However, targeting the 
very same variant to the cytoplasm by removing 
the nuclear localization sequence (NLS; generat- 
ing FUS-mutRRM/ZnFANLS) led to a strong in- 
crease in cell death, which was likely caused by 
the high propensity of this variant to form solid 
aggregates (figs. S21 to $23). Importantly, this 
increase was not observed for a cytosolic variant 
of FUS with normal RNA binding (FUSANLS). 
Thus, we conclude that excessive phase separation 
in the cytoplasm owing to low RNA levels induces 
a pathological state that leads to cell death. 

One of the key questions in protein misfolding 
diseases caused by prion-like RBPs is why these 
proteins aggregate in the cytoplasm rather than 
the nucleus. In this study, we have shown that 
this pattern is due in part to different RNA 
concentrations in the cytoplasm and the nucleus. 
More specifically, the higher RNA concentration 
in the nucleus suppresses phase separation of 
prion-like RBPs, and the lower concentration in 
the cytoplasm stimulates phase separation. There- 
fore, by keeping the proteins in the nucleus, the 
cell ensures that they are in a soluble and non- 
toxic state, shuttling them out of the nucleus only 
upon stress. After the removal of stress, the pro- 
teins shuttle back into the nucleus, where they 
are again kept in a soluble and well-mixed state. 
The consequence is that any insult that prolongs 
the stress will tend to increase the propensity for 
aggregation because it prolongs the time that 
these proteins spend in the cytoplasm (fig. $24). 

Our data also have important implications for 
the control of phase separation in cells. We find 
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that paraspeckles are likely induced by locally 
concentrating Neat1 RNA, which has a strong af- 
finity for FUS. Similar phenomena have been seen 
for nucleoli, which depend on local production of 
ribosomal RNA (24). Therefore, at least in the nu- 
cleus, local production of RNAs with high affinity 
for specific RBPs may provide the specificity to 
induce phase separation in a system buffered by 
nonspecifically interacting RNA. Thus, the phase 
behavior of FUS in the nucleus is likely controlled 
by many different types of specific and nonspecific 
RNAs. This situation does not apply to the cyto- 
plasm. There, the RNA concentration is only slightly 
higher than the concentration required to suppress 
phase separation in vitro and there is no buffering 
of phase separation by RNA. This environment re- 
sults in a much higher propensity of FUS to phase- 
separate. However, it also increases the tendency of 
FUS to form cytotoxic solid-like aggregates. Large 
amounts of RNA have been shown to suppress the 
toxicity of prion-like RBPs (25-28). Moreover, there 
are many cases of familial ALS in which mutated 
prion-like RBPs mislocalize to the cytoplasm and 
form cytotoxic aggregates. For example, mutations 
in FUS have been shown to increase its cytoplasmic 
concentration, thus causing the formation of aber- 
rant solid-like aggregates (8, 9, 29-31). We predict 
that local changes in RNA levels or RNA binding 
abilities of proteins are frequent causes of age- 
related protein misfolding diseases. 
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mRNA structure determines 
specificity of a polyQ-driven 


phase separation 


Erin M. Langdon," Yupeng Qiu,” Amirhossein Ghanbari Niaki,” Grace A. McLaughlin," 
Chase A. Weidmann,’ Therese M. Gerbich,' Jean A. Smith,’ John M. Crutchley,* 
Christina M. Termini,* Kevin M. Weeks,” Sua Myong,” Amy S. Gladfelter’** 


RNA promotes liquid-liquid phase separation (LLPS) to build membraneless 
compartments in cells. How distinct molecular compositions are established and 
maintained in these liquid compartments is unknown. Here, we report that secondary 
structure allows messenger RNAs (mRNAs) to self-associate and determines whether an 
mRNA is recruited to or excluded from liquid compartments. The polyQ-protein Whi3 
induces conformational changes in RNA structure and generates distinct molecular 
fluctuations depending on the RNA sequence. These data support a model in which 
structure-based, RNA-RNA interactions promote assembly of distinct droplets 

and protein-driven, conformational dynamics of the RNA maintain this identity. Thus, 
the shape of RNA can promote the formation and coexistence of the diverse array 

of RNA-rich liquid compartments found in a single cell. 


ormation of non-membrane-bound organ- 

elles through the condensation of macromol- 

ecules is a recently appreciated mechanism 

of intracellular organization. These liquid- 

like condensates form through liquid-liquid 
phase separation (LLPS) and are found in the 
cytoplasm and nucleus (J, 2). A fundamental un- 
solved problem is how liquid droplets recruit 
distinct constituents and retain independent 
identities. RNA can drive LLPS and modulates 
the material properties of droplets (3-6), but it is 
unknown whether RNA controls the identity and 
maintenance of coexisting liquid compartments. 
Here, we show that mRNA secondary structure is 
required for droplet identity through directing 
interactions between mRNAs and RNA binding 
proteins. 

Whi3, a polyQ-containing, RNA binding pro- 
tein first identified in Saccharomyces cerevisiae 
(7), functions in morphogenesis, memory of 
mating, and stress responses, where it forms 
aggregates and associates with RNA-processing 
bodies (8-11). The homolog in the filamentous 
fungus Ashbya gossypii has an RNA recognition 
motif (RRM) and an expanded polyQ tract (fig. 
S1A), and both regions promote self-assembly. 
In vitro, Whi3 polyQ-dependent LLPS is driven 
by specific RNAs encoding regulators of either 
the cell cycle (e.g., the cyclin CLN3) or actin (e.g., 
the formin BNII and SPA2) (3). Distinct types 
of Whi3 droplets form in Ashbya cells: perinu- 
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clear CLN3 droplets and BNII droplets at sites 
of polarized growth at cell tips (12, 13) (Fig. 1A 
and movie S1). These two types of droplets have 
different Whi3 levels and Whi3 incorporation 
rates (Fig. 1, B and C), suggesting that they are 
structurally distinct. 

The distinct droplet properties may depend 
on extrinsic features of the local cytosolic micro- 
environment or arise due to different droplet 
constituents. CLN3 and BNIJ mRNAs minimally 
colocalized in the cytoplasm by single-molecule 
(sm) RNA fluorescence in situ hybridization 
(smFISH), although they were occasionally coex- 
pressed by the same nucleus (Fig. 1, D and F). 
The lack of colocalization suggests there are in- 
trinsic, compositional differences between drop- 
lets. In contrast, mRNA of the polarity regulator 
SPA2, significantly colocalized with BNI7 mRNAs, 
especially at growth sites (Fig. 1, E and F). Thus, 
mRNAs encoding functionally related proteins 
colocalize, whereas functionally unrelated mRNAs 
do not. How can distinct Whi3 binding mRNAs 
segregate to different droplets in a common 
cytoplasm? 

To address this question, we employed a re- 
constitution system to investigate whether mRNA 
sequence was sufficient to generate droplet in- 
dividuality (Fig. 2A). In vitro, as in cells, drop- 
lets composed of BNJJ mRNA displayed higher 
Whi3 to RNA molar ratios than droplets made 
with CLN3 mRNA (fig. SIB). Notably, when CLN3 
mRNA was added to Whi3 droplets made with 
BNII mRNA, CLN3 preferentially assembled into 
new droplets, rather than incorporating into 
BNI droplets (Fig. 2, B and C, and fig. S1C). In 
contrast, BNJ7 mRNA readily incorporated in- 
to preformed droplets (Fig. 2, B and C). Notably, 
SPA2 mRNA incorporated into BNII droplets 
(Fig. 2, B and C), and CLN3 did not incorpo- 
rate into SPA2 droplets (fig. SID). Thus, as in 
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cells, cyclin and polarity mRNAs assemble into 
distinct and immiscible droplets in vitro, in- 
dicating that droplet identity is encoded by 
the mRNA. 

mRNA sequences could influence droplet iden- 
tity by favoring homotypic or specific hetero- 
typic interactions between RNA molecules. To 
test for specific RNA-RNA interactions, we used 
a protein-free system to induce electrostatic- 
mediated phase transitions of the mRNA (J4), 
where all mRNAs were capable of homotypic 
assembly into liquid or gel-like droplets (Fig. 2D). 
Strikingly, CLN3 mRNAs had minimal colocal- 
ization with BNI or SPA2 mRNAs, whereas BNI 
and SPA2 were significantly more colocalized 
(Fig. 2, E to G). Thus, sequence-encoded features 
of the mRNA can underpin the assembly of dis- 
tinct, immiscible structures. 

We next investigated which features of the 
mRNA sequence generate specificity. An mRNA 
with scrambled CLN3 coding sequence (cln3 scr) 
with intact Whi3 binding sites formed Whi3 
droplets (fig. SIE) but no longer showed spe- 
cificity (Fig. 3, A and C). Because the length, nu- 
cleotide composition, and Whi3 binding sites 
were identical, we hypothesized that the sec- 
ondary structure could promote specificity. CLN3 
mRNA heated to 95°C to disrupt secondary struc- 
ture also readily incorporated into Whi3-BN/I 
droplets (Fig. 3, A and C). Melted CLN3 mRNA 
that was slowly refolded (CLN3 refold) showed 
significantly less recruitment than melted but 
more than native CLN3 (Fig. 3, A and C). Mixing 
between melted CLN3 and melted BNI occurred 
in the presence of Whi3 and in RNA-only re- 
actions, suggesting that mixing is initiated by 
RNA-RNA interactions (fig. S2). Thus, specificity 
information in CLN3 mRNA can be eliminated 
by disrupting secondary structure. 

To identify what features of CLN3 mRNA sec- 
ondary structure promote specificity, we per- 
formed selective 2’-hydroxyl acylation analyzed 
by primer extension and mutational profiling 
(SHAPE-MaP), which identifies highly flexi- 
ble regions in RNA (15), to determine second- 
ary structure changes on native, refolded, and 
scrambled CLN3 mRNA (Fig. 3D and fig. S3, A 
and B). The first 400 nucleotides in the CLN3 
sequence exhibited especially low SHAPE re- 
activity (fig. S3C, purple shaded regions), sug- 
gesting many paired nucleotides and a highly 
folded structure. Refolded CLN3 had a signif- 
icant increase in SHAPE reactivity compared 
with native CLN3 (fig. S3A) (P < 0.001, Wilcoxon 
rank sum test), indicating a transition to a more 
unstructured state (Fig. 3, D and E). Melting 
and refolding thus allows the RNA to sample 
different conformations from those formed dur- 
ing transcription. As expected, cln3 scr showed 
a different SHAPE profile with dramatically al- 
tered secondary structure (Fig. 3, D and E, and 
fig. S3B). 

We hypothesized that secondary structure in- 
fluences mRNA sorting, because stem loops may 
selectively display or mask sequences capable 
of hybridizing with other RNAs. CLN3 contains 
five complementary regions to BNII (fig. S4A), 
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CLN3 


most of which had low SHAPE reactivity and 
therefore were more structured (fig. S4B), sug- 
gesting that these regions are inaccessible for 
hybridizing with BNII. We hypothesize that 
these regions became available to pair with BNI 
when CLN3 is melted, causing the structure- 
dependent loss of droplet specificity. To test this 
hypothesis, oligonucleotides (i.e., oligos) com- 
plementary to these regions were added to melted 
CLN3 and significantly decreased the coassem- 
bly with BNIJ, restoring the formation of dis- 
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Fig. 1. Cyclin and polarity 
complexes are spatially 
and physically distinct 
within the cell. (A) (Top) 
Whi3 forms liquid droplets 
in Ashbya gossypii. (Bot- 
tom) Whi3 droplets accu- 
mulate and fuse around 
nuclei. Green arrow denotes 
polarity droplets. Pink 
arrows denote perinuclear 
droplets. Scale bars, 5 um. 
(B) Mean intensity of Whi3- 
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(C) Rate of Whi3 incorpora- 
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SPA2 


tinct CLN3 droplets (Fig. 3, B and C). Additionally, 
cln3sm, a mutant perturbing structure and ex- 
posing complementarity, colocalized with BNII 
transcripts in vitro and at polarity sites in cells 
(Fig. 3F and fig. S5) (>60% tips colocalized). Thus, 
secondary structure can regulate RNA sorting 
into distinct droplets through altering the capac- 
ity to form intermolecular interactions. 

We next asked whether exposed complemen- 
tarity explains coassembly of BNI and SPA2 
into the same droplets. Indeed, SHAPE-MaP 
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(green) MRNAs colocalize 
with polarity mRNA SPA2 
(pink). Nuclei are in blue. 
The green arrow marks 
where the RNAs overlap at 
the tip. Inset scale bar, 

2 um. (F) BNI1 and SPA2 
are significantly more 
colocalized than BNII and 
CLN3. ***P < 0.001 for tips 
and **P < 0.01 for nuclei 
(Fisher's exact test). 

n = 40 nuclei and tips 

for =30 cells. 


analysis of BNI and SPA2 showed complemen- 
tary regions between these colocalizing mRNAs 
having significantly higher SHAPE reactivity and 
less structure compared with the CLN3/BNII re- 
gions (figs. S4 and S6) (P < 0.002, t test). Addition 
of complementary oligos to these regions disrupted 
colocalization in the presence of Whi3 and in RNA- 
only reactions (fig. S7, A and B). We predicted that 
CLN3 may self-assemble and indeed cln3 codon, a 
CLN3 mutant whose codons have been random- 
ized but Whi3 binding sites remain intact, does 
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not colocalize with endogenous CLN3 mRNA in 
cells, further supporting RNA-RNA interactions 
in coassembly of related RNAs (fig. S7C). These 
data suggest that RNA-RNA interactions based 
on intermolecular hybridization direct RNAs into 
the same or different droplets. 


Does Whi3 protein influence the identity of 
droplets? The majority of Whi3 binding sites are 
exposed on stem loops in CLN3, BNII, and SPA2 
(Fig. 3E and figs. S8 and S9). Notably, refolding 
or scrambling the CLN3 sequence rearranges 
the presentation of Whi3 binding sites (Fig. 3E). 


Therefore, RNA secondary structure may influ- 
ence Whi3 binding and contribute to droplet 
composition and immiscibility in addition to 
RNA complexing. SHAPE-MaP analysis of CLN3 
mRNA in the presence of Whi3 supports that 
Whi3 binding sites are occupied by protein (Fig. 4A 


mRNA not sufficient 
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Fig. 2. Polarity and cyclin complexes segregate in vitro. (A) Ex- 
perimental schematic of in vitro droplet recruitment assay. (B) CLN3 
mRNA (pink) is not efficiently recruited, but BN/1 or SPA2 mRNA (pink) are 
recruited into preformed Whi3-BN/1 droplets (green) based on fluorescence 
microscopy. Scale bar, 10 um. (C) Recruitment coefficients of MRNA 

from (B). Boxes indicate interquartile range, line is median, whiskers 
contain points within three times the interquartile range, and outliers are 
indicated with dots. NS, not significant, P > 0.05; **P < 0.01; ***P < 0.001 
(t test). n = 500 droplets for N = 3 biological replicates. (D) Cartoon 
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schematic and representative images showing in vitro RNA-only droplet 
assay where CLN3, BNI/1, and SPA2 mRNAs assemble into liquid or gel-like 
droplets. Scale bar, 5 um. (E) Fluorescence microscopy images showing 
that BN/I RNA (green) colocalizes with SPA2 RNA (pink) in droplets. 

(F) Fluorescence microscopy images showing that CLN3 RNA (pink) does 
not colocalize with SPA2 (green) and BNI/1 (green) droplets. Scale bar, 

5 um. (G) Quantification of colocalization between BN/1 and SPA2, SPA2 
and CLN3, or BNII and CLN3 RNAs. NS, not significant, ***P < 0.001 
(Wilcoxon rank sum test). n = 200 droplets for N = 3 biological replicates. 
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Fig. 3. RNA secondary structure determines specificity and identity 
of Whi3-CLN3 droplets. (A) Fluorescence microscopy images showing 
the recruitment of scrambled (cln3 scr), melted (CLN3 ml), and refolded 
CLN3 (CLN3 refold) mRNA (pink) into preformed Whi3-BN/1 droplets 
(green). (B) Fluorescence microscopy images showing the loss of recruit- 
ment of CLN3 ml when mixed with oligonucleotides targeting comple- 
mentary sequences of CLN3 to BNI1. Scale bar, 10 um. (C) Quantification 
of (A) and (B). *P < 0.05; **P < 0.01; ***P < 0.001 (¢ test). n = 500 droplets 
for N = 3 biological replicates. (D) Base pairing probability from SHAPE-MaP 
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in vivo. n = 500 droplets for N = 3 biological replicates. 
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Fig. 4. Whi3 binding alters RNA behavior. (A) Differences in SHAPE 
reactivities (ASHAPE) were calculated by subtracting CLN3 SHAPE re- 
activities from CLN3 + Whi3 reactivities. Positive ASHAPE values indicate 
protection from modification in the presence of Whi3, and negative 
ASHAPE reports enhanced reactivity in the absence of Whi3 protein. 

(B) Base-pairing probability compared between CLN3 and CLN3 with Whi3 
shows rearrangements in CLN3 structure in the presence of Whi3. Arcs 
connect base-pairing sites and are colored by probability. (C) Schematic 
of smFRET experiment. (D) FRET histograms before (gray) and after 
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(green) 0.5 or 5 uM Whi3 addition. Purple shaded regions denote high 
and mid FRET states for CLN3 and BNI, respectively. (E) Averaged cy3 
(green), cy5 (red) intensities, and representative FRET traces (blue) 
obtained from smFRET experiments of CLN3 and BNI1 in the presence of 
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CLN3 than BNI1 in the presence of Whi3. (F) Proposed model in which 
RNA-RNA interactions derived from mRNA structure promote the selective 
uptake of distinct RNAs and protein constituents into droplets leading to 
distinct dynamics (orange zigzags) of different droplet complexes. 
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and fig. SIOA) and revealed that protein bind- 
ing causes structural rearrangements (Fig. 4B). 
We therefore hypothesize that Whi3 binding 
may have important contributions to structural 
rearrangements of target RNAs relevant to drop- 
let identity. 

To examine the consequence of Whi3 bind- 
ing to RNA, we used smFRET (fluorescence res- 
onance energy transfer) (Fig. 4C) to measure the 
conformational dynamics of CLN3 and BNII 
mRNAs with and without Whi3 (/6). In the ab- 
sence of protein, CLN3 RNA showed high FRET 
values indicative of a compacted state, whereas 
BNI RNA showed lower FRET values, indicat- 
ing a less compact state (Fig. 4D, purple shaded 
regions). Upon addition of Whi3, CLN3 FRET 
values decreased, indicating that more extended 
RNA conformations were induced, dependent 
on the ability of Whi3 to bind mRNA (fig. S10, B 
and C). In contrast, bound to Whi3, BNJ7 RNA 
showed a more substantial broadening of FRET 
values (Fig. 4D), indicating that Whi3-BN/7 com- 
plexes are more dynamic. Dwell-time analysis 
revealed that Whi3-induced dynamics are three 
times faster for BNZI than CLN3 (Fig. 4E). Differ- 
ent mRNAs thus react differentially in their intra- 
molecular fluctuations to the presence of Whi3, 
providing an additional mode of RNA droplet 
regulation. 

These FRET studies suggest that Whi3 bind- 
ing alters the conformational dynamics of target 
RNAs. We speculate that these differential dy- 
namics help maintain droplet identities estab- 
lished by RNA-RNA interactions. Once RNA-RNA 
interactions are formed, Whi3 binding may re- 
duce the ability of the RNA to resample many 
alternate RNA structures to maintain the iden- 
tity. Additionally, the slower fluctuations of CLN3 
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bound to Whi3 may be one source of exclusion 
from the more rapidly fluctuating BNZ7-Whi3 com- 
plexes in those droplets. Such dynamics may drive 
the droplet material properties reported previously 
(3) and serve as barriers to homogenization. 
We show that mRNA structure defines the 
ability of an RNA to engage in homo- or hetero- 
meric interactions and thus drives specificity in 
the composition of liquid droplet compartments 
(Fig. 4F). This mechanism is likely relevant for 
the sorting of specific RNAs to other RNA gran- 
ules, such as stress and P granules, and P bodies 
(7, 18). Future work will address the timing and 
location of how mRNA secondary structure in- 
fluences selective uptake of cellular constituents 
into droplets. Protein binding to different RNAs 
can lead to varied dynamics of complexes, further 
distinguishing the physical properties of different 
droplets and promoting immiscibility of coexist- 
ing droplets. Given the large number of distinct, 
RNA-based condensates in the cell, these mech- 
anisms are likely broadly relevant to explain how 
droplets achieve and maintain individuality. 
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in your field? Together we look forward to your application for the new 
Sartorius & Science Prize for Regenerative Medicine & Cell Therapy. 


Apply now! 

www.passionforscience.com/prize The Sartorius & Science 
Prize for Regenerative 
Medicine & Cell Therapy 


Awarded by sartorius 


Software 
takes over 
Vision. 


Not only THINGS but even the BRAIN and 
CELLS connect to the Internet. For example, it 
is no longer a dream for blind people to regain 


their vision through software control. The ‘ } : 


NEW.VISION project is now on track to make 


it a reality. Check out our website for details. L | 


SOFTWARE-BASED VISION RESTORATION PROJECT 
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HTTPS://WWW.NEWVISION-PRJ.COM/ 
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LIFE SCIENCE TECHNOLOGIES 


new products 


2-Minute Next-Gen PCR 
Introducing 2-minute PCR— 
NEXTGENPCR is redefining “fast.” 

The NEXTGENPCR device heats and 
cools samples instantly, losing no 
time getting samples to the desired 
temperature. Go from melting to 
annealing in less than one-tenth of a 
second. Complete PCR in as little as 2 
minutes. Using one thermocycler, you 
can run more than 10 plates per hour. The math adds up quickly—10 
plates per hour for 8 hours in a 384-well format means more than 
30,000 data points in a single day. Process more samples per day for 
sequencing, genotyping, and other PCR applications by incorporating 
NEXTGENPCR into your laboratory. 

Canon BioMedical 

For info: 844-226-6624 

www.canon-biomedical.com/gofaster 


Antibody Labeling Kits 

Mix-n-Stain antibody labeling kits are a revolutionary technology 
that dramatically simplifies the process of antibody labeling. The 
kits enable a researcher to covalently label <5 ug-100 ug of an 
antibody with one of 20 super bright and photostable CF dyes, 
biotin, or fluorescein isothiocyanate (FITC) in only 30 minutes, with 
100% recovery of antibody after labeling. The prepared conjugates 


perform comparably to prelabeled antibodies from leading suppliers. 


Because the labeling is covalent, the conjugates are stable for long- 
term storage, and ideal for multicolor imaging. Each kit comes with 
everything you need to perform the conjugation. Simply mix your 
antibody with a vial of premeasured CF dye, biotin, or FITC in the 
buffer provided—a step taking less than 30 s. After waiting for 

30 min, without a separation step, you will have an optimally 
labeled antibody conjugate ready for use in any immuno- 
fluorescence staining experiment. 

Biotium 

For info: 800-304-5357 

www.biotium.com 


Trypsin For Mass Spectrometry 

MilliporeSigma’s Advanced Proteomics Grade SOLu-Trypsin is 

an exclusive, solution-stable enzyme for mass spectrometry. 
Designed to be stable in solution when refrigerated, it can be 

used immediately without preparation. Other forms of trypsin 
require thawing or reconstitution and must be discarded if not 
used immediately. SOLu-Trypsin allows excess product to be saved 
for future use, thus eliminating unnecessary waste and cost. It 

is formulated with a high-purity recombinant trypsin, free of 
chymotryptic activity, to ensure high-fidelity digestion, and is stable 
for short-term use at room temperature in an autosampler or ona 
liquid-handling robot. 

MilliporeSigma 

For info: 800-325-3010 

www.sigma.com/solu-trypsin 
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Hematopoietic Progenitor Cell Culture Medium 

Irvine Scientific’s PRIME-XV Hematopoietic Cell Basal Medium is a 
serum-free, xeno-free basal medium that specifically supports robust 
expansion of hematopoietic progenitors while maintaining their 
multipotency—the functional properties they need for cell-based 
therapies. Hematopoietic stem and progenitor cells (HSPCs) give 

rise to all other blood cells, and therefore have therapeutic potential 
for the treatment of cancers such as leukemia and lymphoma, and 
nonmalignant diseases such as sickle-cell anemia or immune-related 
disorders. With numbers of cells in the hundreds of millions required 
for therapy, the challenge has been obtaining sufficient numbers 

of HSPCs that maintain the progenitor cell properties required to 
differentiate and mature into the various hematopoietic lineage cells 
needed for therapies. PRIME-XV is manufactured under Current Good 
Manufacturing Practices using stringent raw-material qualification, 
and offers consistency and reliability while supporting the move from 
research toward clinical applications. 

Irvine Scientific 

For info: 800-577-6097 

www. irvinesci.com 


Bioreactor System 

The BioFlo 120 is a bench-scale bioreactor/fermentor system perfectly 
suited for all levels of R&D. Flexibly designed to meet the wide-ranging 
needs of today’s scientists, the system features 24 interchangeable heat- 
blanketed and water-jacketed autoclavable vessels, along with BioBLU 
Single-Use Vessels ranging from 250 mL to 40 L working volume. Digital 
Mettler Toledo ISM (Intelligent Sensor Management) sensor technology 
is fully integrated, allowing the user to choose between pH, redox, dis- 
solved oxygen (polarographic and/or optical), and CO, measurements 
without the need for additional equipment. A high-precision, expanded- 
range thermal mass flow controller is also available, which can provide 
low-flow mammalian cell culture through high-demand microbial pro- 
cesses on a single controller. Its ease of use and flexibility ensure that 
the BioFlo 120 will serve scientists for years to come—be it in academic, 
governmental, or industrial research settings. 

Eppendorf 

For info: 800-645-3050 

www.eppendorf.com/bioflo120 


GC/Q-TOF System 

The Agilent 7250 GC/Q-TOF system combines gas chromatography (GC) 
and quadrupole time-of-flight mass spectrometry (Q-TOF) with a low- 
energy electron ionization (El) source that allows greater exploration of 
unknown chemical samples. The system is designed for laboratories do- 
ing food and environmental testing, life science research, forensics, and 
chemical analysis. It represents the only commercially available combi- 
nation of a high-resolution, accurate-mass GC/MS and low-energy ioniza- 
tion source, enabling scientists to employ analytical techniques that were 
previously impractical, or in some cases impossible. The 7250 GC/Q-TOF 
system will now permit labs to quickly and easily identify volatile and 
semivolatile compounds that once required them to perform alternative 
techniques or longer sample preparations to detect and determine the 
compounds present in their samples. Furthermore, with low-energy El, 
scientists will be better able to elucidate chemical structures. 

Agilent Technologies 

For info: 800-227-9770 

www.agilent.com 
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Apply for the Science & SciLifeLab Prize for Young Scientists — 
an annual prize awarded to early-career scientists. The prize is 
presented in four categories: Cell and Molecular Biology, Genomics 
and Proteomics, Ecology and Environment, and Translational Medicine. 


The winners will have their essays published by Science, win up 

to USD 30,000 and be invited to a week in Sweden to attend 

the award ceremony. Get ready for a life-changing moment in 
your scientific career. 
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ECOLE POLYTECHNIQUE 
FEDERALE DE LAUSANNE 


Faculty Position 
in Computational Environmental 


Sciences and Engineering 
at the Ecole polytechnique fédérale de Lausanne (EPFL) 


The EPFL School of Architecture, Civil and Environmental Engineering (ENAC) invites 
applications for a faculty position in its Institute of Environmental Engineering (IIE), either at 
the tenure track level (Assistant Professor) or the tenured level (Associate or Full Professor). The 
appointee will join the newly formed EPFL Centre for Changing Alpine and Polar Environments 
(CAPE), based in Sion, Switzerland, and contribute to research and teaching activities within 
IIE. This appointment is one of several CAPE professorships, and offers unrivalled collaboration 
opportunities at the local and European levels. 


Analogously to the early days of computational biology, computational environmental sciences 
and engineering are now on the rise. The role of computation in understanding changing 
environments is pervasive, being central to, for example, process-based models, design of 
sensor networks, and big data analysis and visualization. The research vision of CAPE includes 
developing understanding of connections between models across scales, from global earth system 
simulation to small-scale ecosystem functioning. The appointee will investigate multiscale 
analysis of environmental systems found at high latitudes or high elevations. Research foci of 
interest include, but are not limited to, development of multiscale models linked with widespread 
environmental data, sensor networks, biogeochemical fluxes across scales, hydrological and 
ecological networks, and data-driven environmental system modelling. 


We seek an outstanding individual who will lead an internationally recognized research program 
that extends and leverages the opportunities offered by CAPE/EPFL. The professor will be 
committed to excellence in research and in undergraduate and graduate level teaching, and will 
contribute to the teaching program in Environmental Engineering at EPFL, which views basic 
and translational research as the foundation for environmental adaption and engineering design. 


With its main campus located in Lausanne and its developing antennae in neighbouring 
cantons in Switzerland, EPFL is a growing and well-funded institution fostering excellence 
and diversity. It is well equipped with experimental and computational infrastructure, and 
offers a fertile environment for research collaboration between different disciplines. The EPFL 
environment is multilingual and multicultural, with English serving as a common interface. 
EPFL offers internationally competitive start-up resources, salaries, and benefits. It is committed 
to increasing the diversity of its faculty, and strongly encourages women to apply. 


The following documents are requested in PDF format: cover letter including a statement of 
motivation, curriculum vitae including explicit mention of career breaks, publications list, 
concise statements of research and teaching interests (3-5 pages) as well as the names and 
addresses, including emails, of at least three references for junior positions or five references 
for senior positions (may be contacted at a later stage). Applications should be uploaded to the 
EPFL recruitment web site: 


https://facultyrecruiting.epfl.ch/position/10977282 


Formal evaluation of the applications will begin on September 1, 2018 and the search will 
continue until the position is filled. 


Further enquiries should be made to: 
Prof. D. Andrew Barry 

Chair of the Search Committee 

E-mail: searchenvironmental@epfl.ch 


For additional information on EPFL: 
http://www.epfl.ch, http://enac.epfl.ch; https://valais.epfl.ch/Home 


EPFL is an equal opportunity employer and a family friendly university. 
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University of Pittsburgh 
Graduate School of Public Health 
Department of Epidemiology 


ASSOCIATE/ASSISTANT PROFESSOR OF EPIDEMIOLOGY 
Infectious Diseases 


The Department of Epidemiology, Graduate School of Public Health 
(GSPH), University of Pittsburgh, Infectious Diseases Epidemiology 
Research Program is recruiting tenure stream faculty at the Associate 
or Assistant Professor levels with expertise in infectious diseases 
epidemiologic research. Opportunities are available for collaboration 
with the Pitt Center for Global Health, Pitt clinical research sites in 
Mozambique and South Africa, and Center for Vaccine Research. GSPH 
regularly ranks among the highest in NIH funding. The Department of 
Epidemiology has a strong research portfolio, collaborating locally with 
other institutions across the University, nationally, and internationally. 
Initial salary support, start-up funds, and laboratory space (if needed) 
will be provided. 


Candidates should have a doctoral degree (MD and/or PhD) and at 
least 3 years postdoctoral experience with a demonstrated record of 
recent research accomplishments and sufficient evidence to establish an 
independent, funded research program. Salary and academic rank will 
be commensurate with experience. 


Applications will be accepted until positions are filled. Send letter of 
intent, curriculum vitae, statement of research interests, and the names 
of three references by E-mail to: Position #0124922, c/o D. Bushey, 
Department of Epidemiology, University of Pittsburgh, A528 
Crabtree Hall, 130 DeSoto Street, Pittsburgh, PA 15261; E-mail: 
dlb22@pitt/edu. 


EEO/AA/M/F/Vets/Disabled. 


Open Rank Faculty Position 
(tenure-track/tenured), 
i) eC es Orn soppy py Dept. of Molecular Biology 
CASE Ww = I ERN RE SERVE and Microbiology, Case 
: : Western Reserve University 
(CWRU), Part of the CWRU - Cleveland VAMC Center for Antimicrobial 
Resistance and Epidemiology (Case VA CARES). We are seeking applications 
for a 12-month tenure track or tenured position at the Assistant, Associate, 
or Full Professor level. Case VA CARES aims to strengthen and expand the 
research program targeted at understanding mechanisms of antibiotic resistance. 
We encourage applications from individuals with demonstrated experience in: 
molecular basis of antimicrobial action and resistance, alternative approaches 
to combating infections, animal models of infection with MDR organisms, as 
well as molecular epidemiology of MDR infections and bioinformatics. 


SCHOOL OF MEDICINE 


Successful candidates will establish a vigorous research program, teach, 
and interact with a nationally-ranked team scientists interested in AMR and 
infectious diseases. A Ph.D. or M.D. degree, as well as at least 3 years of 
postdoctoral training are required. Candidates at the Assistant Professor level 
must have demonstrated a capacity for independent research and exhibit a high 
likelihood of attracting federal funding. At the Associate and Full Professor 
level, candidates should have a record of federal funding, an active research 
program, and evidence of a national/international reputation. Rank and salary 
will be commensurate with experience. 


The University offers partner benefits and is responsive to the needs of dual- 
career couples. Applicants must email a letter of application, CV, brief statement 
of research goals and accomplishments, and 3 references as PDF files to: 
MBIO-VAMC Search Committee (VAMBIO205@CASE.EDU). Only complete 
applications will be considered. 


In employment, as in education, Case Western Reserve University is 
committed to Equal Opportunity and Diversity. Women, veterans, members 
of underrepresented minority groups, and individuals with disabilities 
are encouraged to apply. Case Western Reserve University provides 
reasonable accommodations to applicants with disabilities. Applicants 
requiring a reasonable accommodation for any part of the application and 
hiring process should contact the Office of Inclusion, Diversity and Equal 
Opportunity at 216-368-8877 to request a reasonable accommodation. 
Determinations as to granting reasonable accommodations for any applicant 
will be made on a case-by-case basis. 
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ECOLE POLYTECHNIQUE 
FEDERALE DE LAUSANNE 


EPFL’s School of Architecture, Civil and Environmental 
Engineering (ENAC) and the Swiss Federal Institute 
for Forest, Snow and Landscape Research (WSL) invite 
applications for either a tenure-track (Assistant) or 
tenured (Associate or Full) Professor in the Institute of 
Environmental Engineering. The appointee will lead the 
Swiss Romande WSL Site, and contribute to research 
and teaching activities within the EPFL Institute of 
Environmental Engineering. 


The Institute of Environmental Engineering (ITE) in ENAC 
carries out basic and translational research spanning 
fundamental understanding of environmental systems and 
their resilience to design of adaption strategies. It covers 
a diverse portfolio in research, teaching and innovative 
technology development, including: Climate Change 
and Geochemical Cycles, Hydrology, Hydrodynamics, 
Precipitation, Water Quality, Bioremediation, 
Ecotoxicology, Air Quality, Renewable Energy, City 
and Landscape Development, Ecosystems, Ecology 
and Robotics. These research themes are underpinned 
by extensive facilities in, e.g., chemical, microbial and 
isotopic analysis. This professorship, along with the WSL 
Swiss Romande site, is located at EPFL’s main campus 
at Lausanne. While IIE as a whole is mainly based in 
Lausanne, it is currently undergoing expansion at the 
EPFL Sion campus, with the creation of the new Centre 
for Changing Alpine and Polar Environments. 


WSL is the national Swiss organization for basic and 
applied research into use and management of natural 
resources including forests, landscapes, and biodiversity. 
It focuses on use and protection of resources and the 
management of risks of natural hazards. Together with 
civic and governmental bodies, WSL seeks sustainable 
solutions to societally relevant questions. 


The joint WSL/EPFL professor in terrestrial ecology 
will have acknowledged strengths in research related to 
forests, biodiversity and ecosystem services in dynamic 
environments. Areas of interest within these domains 
include, but are not limited to: functional biodiversity 
in different settings (urban to forested), responses and 
adaption of ecosystems to global changes, management 
of ecosystem services at local-to-global scales, plant-soil 
interactions, and ecosystem climate responses. 


The WSL/EPFL professor will lead an internationally 
recognizedresearchprogram that leverages the opportunities 


Faculty Position in Terrestrial Ecology 


Joint Appointment between the Swiss Federal Institute for 
Forest Snow and Landscape Research (WSL) and 
the Ecole polytechnique fédérale de Lausanne (EPFL) 


offered by WSL and EPFL. The professor will support WSL 
research and outreach activities, and enhance the group’s 
integration into teaching and transdisciplinary research 
within ENAC. The appointee will promote excellence in 
research and in undergraduate and graduate level teaching. 


The WSL/EPFL Professor of Terrestrial Ecology will 
be located at EPFL. EPFL is a growing and well-funded 
institution fostering excellence and diversity, with a highly 
international campus at an attractive location with excellent 
experimental and computational infrastructure. Teaching 
and research at EPFL covers essentially the entire palette 
of engineering and science, and offers a fertile environment 
for research collaboration between different disciplines. 
The EPFL environment is multilingual and multicultural, 
with English serving as acommon interface. It is committed 
to increasing the diversity of its faculty, and strongly 
encourages women to apply. 


The following documents are requested in PDF format: cover 
letter including a statement of motivation, curriculum vitae 
including explicit mention of career breaks, publications 
list, concise statements of research and teaching interests 
(3-5 pages) as well as the names and addresses, including 
emails, of at least three references for junior positions or 
five references for senior positions (may be contacted at a 
later stage). 

Applications should be uploaded to the EPFL recruitment 
web site 


https://facultyrecruiting.epfl.ch/position/10977284 


Formal evaluation of the applications will begin on 
August 15, 2018 and the search will continue until the 
position is filled. 


Further enquiries should be made to: 
Prof. D. Andrew Barry 

Chair of the Search Committee 
E-mail: SearchTerrestrialEco@epfl.ch 


For additional information on WSL and EPFL: 
http://wsl.ch, http://www.epfl.ch, http://enac.epfl.ch 


EPFL is an equal opportunity employer and a_ family 
friendly university. 
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Register for a free online account on 
ScienceCareers.org. 


Search thousands of job postings and find 
your perfect job. 


Sign up to receive e-mail alerts about job 
postings that match your criteria. 


Upload your resume into our database and 
connect with employers. 


Watch one of our many webinars on 
different career topics such as job 
searching, networking, and more. 


Download our career booklets, including 
Career Basics, Careers Beyond the Bench, 
and Developing Your Skills. 
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Complete an interactive, personalized 
career plan at “my IDP.” 


Visit our Career Forum and get advice from 
career experts and your peers. 


Research graduate program information 
and find a program right for you. 


Read relevant career advice articles from 
our library of thousands. 
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Professor of Solid State 
NMR Spectroscopy 


> The Department of Chemistry and Applied Biosciences (www.chab.ethz.ch) 
at ETH Zurich invites applications for the above-mentioned position. The 
professorship should have a research focus on NMR spectroscopy of the 

solid state with possible applications in either the life sciences or materials 
research. Candidates are expected to establish a research programme at the 
forefront of method development in their field. Collaboration with other groups 
at ETH in application work or method development is strongly encouraged 

and teaching in all areas of physical chemistry is expected. In general, at 

ETH Zurich undergraduate-level courses are taught in German or English and 
graduate level courses are taught in English. 


= Please apply online: www.facultyaffairs.ethz.ch 


> Applications should include a curriculum vitae, a list of publications, a 
statement of future research and teaching interests, and a description of 

the three most important achievements. The letter of application should be 
addressed to the President of ETH Zurich, Prof. Dr. Lino Guzzella. The closing 
date for applications is 31 August 2018. ETH Zurich is an equal opportunity 
and family friendly employer and is responsive to the needs of dual career 
couples. We specifically encourage women to apply. 


Advance your career 
with expert advice from 
Science Careers. 


Download Free Career Advice Booklets! 
ScienceCareers.org/booklets 


Featured Topics: 
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= Industry or Academia 
# Job Searching 
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Max Planck 
Institute for 
Terrestrial 
Microbiology 


The Max Planck Institute for Terrestrial 
Microbiology announces the position of 


Department Director 


The Max Planck Institute for Terrestrial 
Microbiology in Marburg, Germany invites 
expressions of interest from or nominations 
of outstanding scientists for the position of 
Director. A Max Planck Director pursues a 
bold long-term research agenda in his or 
her own Department, and leads the Institu- 
te jointly with the fellow Directors. We seek 
a director with a high international reputa- 
tion and a proven track record in research 
areas that complement, strengthen or ex- 
pand the research agenda of the Institute 
(for details, please see http://www.mpi- 
marburg.mpg.de/), preferably with a focus 
on interactions between microorganisms, 
on interactions of microorganisms with 
their environment or with their hosts. Chief 
criteria for a directorship are scientific and 
scholarly excellence, creativity, and an inno- 
vative and unique research program, rather 
than seniority or publication rankings. 


We offer an excellent and international 
research environment, outstanding infra- 
structure, and long-term scientific funding. 
The working language is English and qua- 
lified individuals of all nationalities will be 
considered. 


The Max Planck Society is deeply commit- 
ted to the principles of gender equality, di- 
versity, and inclusiveness and welcomes ex- 
pressions of interest from all backgrounds. 
We actively encourage individuals with 
disabilities to submit an expression of in- 
terest. The Max Planck Society is aiming to 
increase the proportion of women among 
its scientific leadership. We therefore stron- 
gly encourage expressions of interests from 
and the nominations of women. 


Both expressions of interest and nomina- 
tions are welcome. Expressions of interest 
must consist of a letter of intent, a full cur- 
riculum vitae incl. a complete publication 
list, a brief summary of previous research 
accomplishments and a_ brief concept 
statement outlining the structure and 
long-term, visionary research agenda of the 
Department. Further information can be 
obtained from the managing director Prof. 
Dr. Victor Sourjik. 


Expressions of interest and nominations 
will be held in strictest confidence and 
should be sent by email 
as a single pdf file to direc- 


tor@mpi-marburg.mpg.de ae r 


Zertifikat seit 2011 


Expressions of interest and aut bernie 
nominations are due by 
June 22nd 2018. 


Short-listed candida- 
tes will participate in a 
search symposium in 
Marburg in November 
2018. 


MAX-PLANCK-GESELLSCHAFT 


online @sciencecareers.org 
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WORKING LIFE 


By Moamen Elmassry 
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What I learned from teaching 


hen I started my Ph.D. program, 3 years ago, I was excited to conduct cutting-edge research. 

But I wasn’t so sure about the teaching that I would have to do every semester to fund my 

education. I saw it as a waste of time and energy that I could otherwise spend in the lab. The 

stereotype of the teaching assistant (TA) rushing between classes and spending weekends 

grading while guzzling coffee did not help. I had no experience teaching, and the idea of 
being in front of a class made me anxious. But if you ask me now, I would say that teaching has been 
one of the most rewarding parts of my Ph.D. experience. 


My first semester was tough. In ad- 
dition to getting my research going, 
I was adjusting to living in a new 
country—I had just moved to the 
United States from my home coun- 
try of Egypt. Adding teaching on 
top of that, with just a brief training 
workshop to prepare me, was not a 
recipe for success. I tried my best 
to help my students learn, but my 
inexperience was apparent. 

I could have carried on as a 
mediocre teacher. But I recalled 
how some of my own teachers had 
inspired me over the years. I felt I 
owed my students the same—which, 
I realized, would require time and 
training. It was my responsibility to 
make that happen, even if it meant 
taking a little more time and focus 
away from my research. 

I started with the online Science 
Teaching Fellows Program from the American Society for 
Microbiology. I also began attending workshops and train- 
ing sessions at my university’s Teaching, Learning, and 
Professional Development Center (TLPDC). I loved being 
part of a supportive community and learned some valu- 
able teaching strategies, which I experimented with in my 
second semester of teaching. For example, I introduced my 
students to epidemiology by asking them to write short 
stories about an epidemic spreading on campus, hoping to 
incorporate more creativity into their learning. This uncon- 
ventional assignment surprised the students at first. But 
some of them got so into it that they wrote much more than 
the half page I had assigned. I loved seeing my students 
so engaged with an activity I had designed. In my end-of- 
semester evaluations, some students said that I was their 
favorite TA, and others asked me to write recommendation 
letters for them, which was both humbling and rewarding. 

I was eager to continue my teacher training. I enthusiasti- 
cally proposed to my adviser that I apply for a TLPDC pro- 


“Teaching has been one 
of the most rewarding parts 
of my Ph.D. experience.” 


gram that requires 30 to 40 hours 
over the course of a semester. I was 
disappointed when he said I hadn’t 
made enough progress on my disser- 
tation research, but I also acknowl- 
edged that he was probably right. 
I found an alternate program that 
would still help me grow as an edu- 
cator without taking as much of my 
time. And I later made enough prog- 
ress on my research to participate in 
the original program—with my ad- 
viser’s approval—the following year. 

I am now focused on establishing 
a balance between my primary job— 
teaching, which is what I am paid 
for—and my primary goal: complet- 
ing my doctoral degree and pursu- 
ing a career in academia. As I work 
to find this balance, I also remind 
myself—and my adviser and thesis 
committee members—that teaching 
has provided me with some unexpected benefits. Knowing 
that I have teaching commitments pushes me to conduct 
efficient, well-designed experiments. Answering undergrad- 
uate students’ fundamental “why” questions helps keep me 
intellectually stimulated and forces me to think about sci- 
ence in new ways, which was useful during my Ph.D. candi- 
dacy exam. I also realized recently that my occasional bouts 
of near depression, triggered by failed experiments, hit only 
during summer and winter breaks, when I do not teach. If 
my research goes badly while I am teaching, at least I can see 
that my work is paying off for my students. 

If you're viewing your teaching assignments with dread, 
I encourage you instead to approach them with an open 
mind and a willingness to learn. Maybe, like me, you'll find 
that teaching is full of unexpected rewards. 


Moamen Elmassry is a Ph.D. candidate at Texas Tech 
University in Lubbock. Do you have an interesting career 
story? Send it to SciCareerEditor @aaas.org. 
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